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ABSTRACT 

COGNITIVE  STRUCTURE:  A COMPARISON  OF 
TWO  THEORIES  AND  MEASURES  OF 
INTEGRATIVE  COMPLEXITY 

by 

Gary  Bruce  Cox 

This  study  was  intended  to  assess  the  generality  of  a particular  type 
of  cognitive  structure  characteristic,  that  of  integrative  complexity.  Pur- 
suant of  this,  the  theories  of  H.  M.  Schroder  and  O.  J.  Harvey  were  com- 
pared, and  their  respective  measures  administered  to  440  Ss  of  both  sexes 
from  three  different  southern  schools. 

The  theoretical  analysis  suggested  that  Schroder's  theory  is  more 
truly  structural  in  nature,  and  is  more  powerful  in  that  it  is  more  easily 
and  obviously  applicable  to  a broad  range  of  cognitive  domains.  Harvey's 
position  is  much  more  firmly  grounded  on  the  content  of  the  interpersonal 
domain.  Both  theories  claim  that  the  characteristic  of  cognitive  structure 
which  is  most  important  in  determining  cognitive  complexity  is  not  differ - 


(ill) 


entiation,  or  an  increase  in  the  dimensionality,  of  the  cognitive  domain,  but 


the  subsequent  integration  of  the  differentiated  components.  Unfortunately, 
neither  theorist  is  able  to  define  integration  so  as  to  clearly  distinguish  it 
from  a dimensional  position.  Here  again,  however,  Schroder's  theory 
seems  to  be  the  stronger,  since  it  is  at  least  explicit  enough  that  the  locus 
of  difficulty  can  be  precisely  identified.  Further,  even  if  Schroder  is  un- 
able to  define  adequately  the  integration  concept,  his  theorizing  suggests 
the  importance  of  the  possibility  of  super-  and  sub-ordinate  relationships 
among  dimensions. 

Results  of  the  testing  were  as  follows:  (a)  As  expected,  the  respec- 

tive measures  of  cognitive  integration  were  nonsignificantly  correlated  with 
each  other,  (b)  Both  measures  of  integration  were  significantly  correlated 
with  such  measures  of  intelligence  as  vocabulary,  abstract  thinking,  and 
SAT  verbal  and  mathematical  scores,  (c)  Sex  differences  in  the  scores 
may  exist,  although  the  pattern  is  not  clear,  (d)  Other  sample  character- 
istics may  affect  the  distribution  of  scores,  e.g.  large  intelligence  differ- 
ences, socioeconomic  differences,  etc.  On  the  other  hand,  Negroes  are 
not  ipso  facto  inferior  to  Caucasians,  even  when  the  latter  enjoy  a 100- 
point  advantage  on  SAT  averages.  Nor  are  Southern  whites  inferior  to 
Northern  whites,  at  least  when  both  are  of  superior  intellectual  ability. 

(e)  Reliability,  as  estimated  by  coefficient  alpha,  is  satisfactory  for  Har- 
vey's measure  and  unsatisfactory  for  Schroder's,  (f)  The  distribution  of 
scores  is  such  that  for  both  measures  complex  Ss  are  rare,  so  pools  of  Ss 


(iv) 


must  be  tested  in  order  to  obtain  adequate  numbers  of  complex  Ss.  This  is 
more  a problem  in  attempting  to  apply  Schroder  than  Harvey,  largely  be- 
cause  Schroder  has  often  not  bothered  to  study  middle-range  Ss,  so  their 
characteristics  are  unknown.  Schroder's  variable  (especially)  is  essen- 
tially inapplicable  to  an  unscreened  group  of  subjects. 

Overall,  Schroder's  theory  seems  more  promising  than  Harvey's. 
Suggestions  were  made  for  improving  the  reliability  and  distribution  of 
scores.  Additionally,  a translation  of  Schroder's  theory  into  dimensional 
terminology  was  attempted,  and  some  important  implications  of  his  posi- 
tion for  the  dimensional  orientation  were  discussed. 
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COGNITIVE  STRUCTURE:  A COMPARISON  OF  TWO  THEORIES 


AND  MEASURES  OF  INTEGRATIVE  COMPLEXITY 


Chapter  I 


INTRODUCTION 


In  1961  O.  J.  Harvey  and  Ho  M.  Schroder  collaborated,  with  D.  E0 
Hunt,  in  writing  the  book  Conceptual  Systems  and  Personality  Organization 
(Harvey,  Hunt  Schroder,  1 9 6 1 ) 0 Since  that  time  both  Schroder  and  Har- 
vey have  actively  elaborated  the  subject.  This  paper  is  a comparison  of 
their  theories  and  measures  of  cognitive  complexity  (or  conceptual  systems 
to  use  Harvey's  term,  or  conceptual  structure  to  use  Schroder's). 

The  need  for  this  comparison  results  from  the  common  assumption 
that  the  two  men  are  still  dealing  with  the  same  theory,  an  impression 
which  is  strengthened  by  their  tendencies  to  refer  to  and  cite  each  other's 
work  as  support  for  their  own. 

Both  theories  are  concerned  with  the  topic  of  cognitive  complexity  or 
cognitive  structure,  a research  area  which  is  having  considerable  trouble 
deciding  what  it  means,  operationally,  by  its  own  terms.  There  is  some 
question  as  to  whether  the  various  measures  of  cognitive  complexity  are 
related  to  each  other.  This  concern  has  taken  the  form  of  a number  of 
studies  testing  the  "generality  of  cognitive  complexity,  " that  is,  whether 
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various  measures  produce  comparable  results.  The  present  study  may  be 

j 

viewed  as  an  example  of  such  work.  Accordingly,  it  is  important  to  ask 

I 

] 

what  evidence  already  exists  concerning  generality  and  how  it  relates  to  the 

i 

specific  question  of  generality  at  hand.  As  a preliminary  it  is  necessary 
to  understand  something  about  the  theoretical  components  of  cognitive  com- 

i 

plexity.  These  will  be  the  two  functions  of  this  chapter. 


Components  of  Complexity 


The  cognitive  structure  theorists  share  the  opinion  that  in  attempting 
to  explain  and  predict  behavior  it  will  be  fruitful  to  look  at  the  structure  or 
pattern  of,  or  relationships  among,  cognitive  elements  (opinions,  beliefs, 

• ' I 

attitudes,  ideas,  etc.)  in  addition  to  the  content  of  the  elements.  Beyond 

| 

this  basic  orientation  agreement  diminishes.  Generally,  complexity  and/or 
structure  is  regarded  as  being  a function  of  differentiation,  articulation  or 

! 

discrimination,  and  integration  (Bieri,  1961;  Schroder,  Driver  & Streufert, 

i 

I 

1967).  Other  writers  (e.g.  Scott,  1969)  have  culled  additional  structural 
concepts  from  the  literature  ("centrality,  11  etc.),  but  these  will  be  con- 

i 

I 

sidered  here  only  very  briefly. 

j 

Differentiation  may  be  defined  either  in  terms  of  dimensions  (Bieri, 

I 

1955)  or  categories  (Gardner,  1953),  and  in  either  case  represents  the 
number  of  stable,  unique  and  important  units  (dimensions  or  categories)  in 
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a particular  cognitive  realm  (content  area).  Dimensional  differentiation  is 
usually  measured  with  the  Kelly  Role  Repertory  Test  (REP)  (Bieri,  1961; 
Kelly,  1955)  or  with  multidimensional  scaling  (Schroder  et  al.,  1967).  The 
semantic  differential,  with  or  without  factor  analysis,  might  also  be  used, 
Categorical  differentiation  may  be  assessed  using  object-sorting  tasks, 
either  without  replacement  (an  object  may  appear  in  only  one  category-- 
Clayton  &:  Jackson,  1961:  Gardner,  1953)  or  with  replacement  (an  object 
may  appear  in  as  many  categories  as  S wishes --Scott,  1962). 

Unfortunately,  there  is  no  obvious  relationship  between  the  two  types 
of  differentiation,  nor,  for  that  matter,  between  the  two  ways  of  measuring 
categorical  differentiation.  Gardner’s  scoring  procedure  is  unsatisfactory 
(see  Messick  & Kogan,  1963),  and  the  researcher  interested  in  object  sort- 
ing would  probably  be  better  off  with  an  adaption  of  Scott's  (1962)  technique. 
Since  categories  may  be  based  on  either  different  dimensions  or  different 
locations  along  a single  dimension,  some  further  measurement  steps  would 
be  required  to  relate  categories  and  dimensions. 

The  justification  for  the  various  dimensional  measures  is  fairly 
straightforward.  The  REP  is  probably  the  most  commonly  used  instrument 
for  assessing  cognitive  complexity/structure  and  has  been  used  in  so  many 
different  ways  that  a separate  chapter  of  discussion  would  be  necessary  to 
explore  the  subject.  As  a result,  one  must  be  chary  of  interpreting  results 
based  on  the  REP  unless  the  way  the  test  was  used  is  also  known. 


Multidimensional  scaling  is  related  to  dimensionality  in  that  one 
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learns  the  number  of  dimensions  necessary  to  support  the  relationships  re- 
ported by  the  subject.  Due  to  the  differences  in  the  models  underlying  scal- 
ing and  factor  analysis,  it  is  not  appropriate  to  name  the  dimensions  for 
the  former,  although  this  is  typically  the  goal  in  the  latter. 

The  second  property  of  cognitive  structures  is  articulation  (Bieri, 
1961,  and  Schroder  et  al.,  1967,  prefer  the  term  "discrimination"),  which 
means  the  number  of  distinctions  one  makes  along  a particular  dimension. 
Articulation  is  expected  to  vary  from  one  dimension  to  another.  Also,  it 
is  a dimensional  and  not  a categorical  characteristic. 

Perhaps  the  best  measure  (in  terms  of  its  flexibility  and  its  consist- 
ency with  dimensional  techniques)  of  articulation  is  that  of  Carr's  (1965, 
Carr  also  prefers  the  term  "discrimination";  Scott,  1969,  uses  a similar 
measure).  Degree  of  articulation  is  to  some  extent  controlled  in  the  se- 
mantic differential  and  in  most  versions  of  the  REP.  Both  Scott  (1967)  and 
Schroder  et_  ad.  (1967)  suggest  that  variables  such  as  equivalence  range 
(Sloane,  Gorlow  &:  Jackson,  1963)  and  category  width  (Pettigrew,  1958)  may 
be  related  to  articulation.  Scott  leaves  it  with  that,  but  Schroder  et  al. 
attempt  to  incorporate  the  findings  that  abstract  Ss  have  broad  "category 
width"  scores  (e.g.  Clayton  &:  Jackson,  1961;  Pettigrew,  1958).  The  theo- 
retical explanation  for  this  phenomenon  is  that,  given  the  choice  between 
many  narrow  dimensions  or  a few  broad  ones,  the  S with  abstract  schemata 
will  "„  . . lean  toward  greater  integration- -that  is,  greater  range  of  stim- 
uli per  dimension- -and,  hence,  to  fewer  dimensions  ..."  (Schroder  et  al.  , 
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1967,  p.  175) . No  real  explanation  or  theoretical  derivation  is  offered  as 
to  why  integration  is  expected,  nor  is  there  any  specification  of  how  to 
recognize  situations  in  which  discrimination  will  be  evidenced  by  integra- 
tion vs,  those  in  which  it  will  appear  as  more  and  finer  gradations.  What, 
for  example,  should  we  expect  in  the  Scott  (1962,  1969)  sorting  task  which 
restricts  neither  the  size  nor  number  of  groups?  Also,  Messick  and  Kogan 
(1963)  have  shown  that  the  Clayton  and  Jackson  type  of  sorting  results  are 
due  more  to  the  failure  to  categorize  than  to  the  number  of  categories 
formed.  So  far  as  I know,  relationships  between  size  of  true  categories 
(two  or  more  items)  with  other  variables,  a more  relevant  question  and  one 
to  which  Scott's  procedure  is  applicable,  has  not  been  investigated.  Apro- 
pos this  topic,  Wallach^  has  suggested  that  it  is  important  to  distinguish 
situations  in  which  Ss  are  given  a category  and  asked  to  fit  cases  to  it  (as 
Pettigrew,  1958)  from  those  in  which  S is  permitted  to  provide  his  own 
categories  or  dimension  (as  Clayton  & Jackson,  1961).  More  broadly,  the 
important  variable  may  be  whether  S construes  the  situation  as  one  in  which 
he  is  supposed  to  find  similarities  or  one  in  which  he  is  supposed  to  find 
differences,  the  complex  S presumably  being  better  at  either. 

The  third  characteristic  of  cognitive  structures,  and  the  last  which 
we  will  consider,  is  integration.  Although  integration  is  mentioned  by 
other  theorists  (e.g.  Scott,  1969),  Harvey  et  ad.  (1961)  stand  alone  in  the 

^M.  A.  Wallach,  personal  communication. 
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degree  of  emphasis  they  put  upon  it  as  the  most  important  determinant  of 
complexity/abstractness  o Scott's  attempts  (1966,  1969)  to  measure  inte- 
gration  differ  radically  from  those  of  either  Harvey  or  Schroder  and  will 
not  be  discussed  at  length,  (Similarly  for  Zajonc,  I960,  who,  from  Scott's 
viewpoint  may  be  dealing  partially  with  integration.)  The  Schroder/Harvey 
approaches  will  be  dealt  with  more  extensively  in  Chapter  III.  In  passing, 
it  should  be  noted  that  the  differences  in  emphasis  on  integration  are  not  a 
logical  consequence  of  theoretical  differences.  This  is  particularly  true 
for  James  Bieri,  who  has  roots  in  George  Kelly's  theory  of  personal  con- 
structs  (Kelly,  1955),  which  is  strikingly  similar  to,  and  in  some  respects 
more  systematic  than,  Harvey  et  al.  (1961). 


The  Generality  of  Cognitive  Complexity 

Questions  about  the  generality  of  cognitive  complexity  have  two  dis- 
tinguishable components,  related  respectively  to  theory  and  operations. 
Essentially  all  theorists  agree  that  complexity  will  vary  across  cognitive 
domains  or  content  areas  (Bieri,  1961;  Gardner  &:  Schoen,  1962;  Scott, 
1962)  or  within  the  same  content  area  across  time  (Zajonc,  I960)  or  as  a 
function  of  situational  variables  (Schroder  et_al.,  1967).  Beyond  this,  how 
ever,  is  the  methodological  question  of  whether  various  measures  of  com- 
plexity at  a particular  time  and  with  regard  to  a particular  topic  would 


■ 
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order  Ss  similarly. 

Among  measures  of  complexity  which  emphasize  dimensionality  and/ 
or  discrimination,  some  findings  favoring  generality  have  been  reported. 
For  example,  Bieri  and  Blacker  (1956)  found  complexity  as  measured  by 
the  REP  test  and  by  Rorschach  perceptions  to  be  related.  Allard  and  Carl** 
son  (1963)  administered  three  different  REP  tests,  which  correlated  at 
moderately  high  levels,  Hess  (1966)  compared  REP  complexity  and  per- 
formance on  a sorting  task  and  reported  that  the  two  were  related,  Todd 
and  Rappoport  (1964)  discovered  that  two  different  models  of  cognitive 
structure  (factor  analysis  and  Hays'  implication  model)  produced  highly 
similar  interstimulus  distances  which,  in  turn,  were  related  to  sorting 
performance.  Since  these  findings  are  somewhat  tangential  to  our  major 
interests,  no  particular  effort  will  be  made  to  evaluate  them. 

More  immediately  relevant  results  which  favor  generality  are  re- 
ported by  Carr  (1965)  who  states  that  conceptual  system  level  is  related  to 
discrimination,  but  not  to  REP  fluency,  and  Harvey's  (1966)  and  Wolfe's 
(1963)  findings  of  relationships  between  REP  scores  (not  fluency)  and  con- 
ceptual level. 

I know  of  three  studies  of  generality  which  are  more  extensive  in 
scope  than  the  above.  Of  these,  Scott's  (1969)  is  not  particularly  relevant 
to  the  immediate  purpose  here,  because  his  theoretical  orientation  is 
dimensional,  and  he  includes  neither  Schroder's  nor  Harvey's  measure  of 
integration.  In  addition  to  dimensionality  and  articulation,  Scott  does 


r 
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measure  what  he  defines  as  integrative  characteristics  (Centrality,  Affec- 

j 

tive  Balance,  and  Affective -Evaluative  Consistency),  but  each  of  these  im- 
plies a reduction  in  the  dimensionality  of  the  cognitive  space,  a position  very 
different  from  that  of  Schroder  or  Harvey,  Scott  measured  each  of  the  five 
characteristics  above  in  several  different  ways  and  found  modest  intra- 
characteristic correlations  across  the  various  scores.  The  characteristics 
correlated  with  each  other  in  the  predicted  pattern,  but  this  evidence  for 
generality  within  Scott's  model  does  not  bear  on  the  question  of  the  gener- 
ality of  Schroder's  and  Harvey's  measures  of  integrative  complexity. 

The  best-known  study  of  generality  is  Vannoy's  (1965)  article  based 
on  his  dissertation  (1964).  Vannoy  includes  Schroder's  Paragraph  Comple- 
; tion  Test  (PCT),  but  not  Harvey's  "This  I Believe"  (TIB)  Test  (both  the 
PCT  and  the  TIB  are  intended  to  measure  integrative  complexity),  plus  a 

I 

i number  of  other  tests  of  varying  and  sometimes  uncertain  relation  to  com- 
plexity. He  concluded  that  complexity  is  not  unitary  but  comprises  two  or 

| 

three  uncorrelated  components.  Unfortunately,  the  study  contains  several 
serious  flaws.  No  one  claims  that  complexity  is  unitary.  As  I have  noted, 
most  theorists  hypothesize  components  of  complexity,  and  most  suggest  that 
complexity  will  vary  with  content.  Any  adequate  examination  of  the  gener- 
ality  issue  would  have  to  be  organized  around  these  points,  and  Vannoy's 
failure  to  keep  them  in  mind  has  resulted  in  an  interpretive  jungle.  First, 
not  all  of  the  tests  he  used  are  primarily  intended  to  measure  complexity. 

A case  can  be  made  that  the  scores  might  be  related  to  complexity,  but  how 
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much  and  in  what  ways  is  not  clear.  For  some  measures,  viz.,  the  "social 
distance"  questionnaire,  the  scores  as  used  (variance  components)  are  ab- 
solutely ambiguous.  Second,  the  cognitive  domain  varies  among  tests.  To 
the  extent  that  there  is  a consistent  domain,  it  would  be  interpersonal,  but 
even  this  rubric  incorporates  a broad  range  of  types  of  stimuli,  varying 
from  four  adjective  descriptions  of  hypothetical  people  to  real  persons  nomi- 
nated by  the  Ss  to  fill  particular  roles.  Clearly,  some  of  the  measures  in- 
cluded should  not  have  been,  and  some  important  procedures  were  left  out. 

Of  most  immediate  interest  here  is  the  finding  that  Schroder's  PCT 
loads  highly  on  the  last  of  Yannoy's  factors,  is  not  accompanied  on  that  fac- 
tor by  any  other  complexity  measure  (SCAT  -Q  is  second  highest)  and  shows 
no  other  high  loading  itself.  It  would  seem  natural  to  notice,  although  Van- 
noy  does  not,  apparently  because  he  did  not  attend  to  such  distinctions,  that 
the  PCT  is  the  only  test  in  the  battery  specifically  designed  for  measuring 
integrative  complexity. 

A much  better  study  is  that  of  Gardiner  (1968).  Gardiner's  greater 

sensitivity  to  "integrative  complexity,  " as  espoused  by  Schroder  and  Har- 
* 

vey,  leads  him  to  include  a total  of  six  measures  of  this  variable.  Scott's 
(1962)  sorting  task  is  included,  but  is  applied  to  nations  (as  Scott  used  it), 
so  its  failure  to  relate  to  other  tests  may  be  due  to  independence  of  com- 
plexity between  domains  as  much  as  to  independence  between  dimension- 


ality (if  sorting  measures  dimensionality)  and  integrative  complexity.  No 


true  measure  of  dimensionality  is  included  (e.  g.  Carr,  1965).  The  REP 
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appears  only  in  a check  list  "version.  " Also  included  in  the  battery  are  a 
number  of  variables  which  are  expected  on  theoretical  grounds  to  relate  to 
complexity.  In  most  cases,  these  derivations  are  quite  straightforward: 
generally,  the  theorists  themselves  have  stated  that  such-and-such  a char- 
acteristic is  related  to  complexity,  and  Gardiner  has  chosen  a commonly 
accepted  (or  acceptable)  measure  of  the  characteristic. 

The  major  findings  are,  first,  that  the  measures  of  integrative  com- 
plexity (the  TIB,  PCT  and  Interpersonal  Topical  Inventory,  or  ITI,  a 
forced-choice  objective  version  of  the  PCT,  two  scores  being  derived  from 
each  of  these  tests)  define  the  first  factor  in  a factor  analysis;  and,  second, 
that  the  third  factor,  which  Gardiner  pays  less  attention  to  than  the  second, 
appears  to  approximate  the  idea  of  a general  complexity  factor.  Tests 
loading  on  this  factor  are  the  PCT,  TIB,  Scott's  sorting  task,  and  Em- 
bedded Figures  Test,  plus  three  other  tests  requiring  flexible,  analytic 
thinking.  In  fact,  the  label  "Analytic  Thinking"  could  be  applied  to  the  fac- 
tor, an  interpretation  which  would  seem  consistent  with  Schroder's  (1969) 
description  of  complex  thinking.  Gardiner  suggests  that  the  factor  relates 
to  dimensionality  or  cognitive  differentiation,  noting  that  the  PCT  and  TIB 
relate  to  dimensionality,  while  Scott's  nation  sorting  task  is  "specifically 
dimensional."  In  that  case,  the  loadings  belie  a strictly  dimensional  inter- 
pretation, since  both  scoring  versions  of  the  TIB,  as  well  as  the  Embedded 
Figures,  load  more  highly  than  does  the  Scott  task. 

The  third  major  finding  is  that,  according  to  Gardiner's  original 
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factor  analysis,  a number  of  characteristics  which  have  been  hypothesized 
to  covary  with  complexity  do  not,  including  two  measures  of  flexibility. 
Associations  IV  and  Gestalt  Transformations;  two  tests  involving  acquies- 
cence, Social  Acquiescence  and  Seeing  Deficiencies;  a scale  of  Internal- 
External  Causation;  and  two  measures  based  on  tolerance  for  inconsistency 
in  attributed  descriptions  (references  for  these  tests  may  be  found  in 
Gardiner,  1968).  However,  there  is  some  question  as  to  the  appropriate- 
ness of  testing  such  hypotheses  with  factor  analysis.  The  virtue  of  factor 
analysis  is  that  it  compresses  or  organizes  large  quantities  of  data.  In 
the  process,  finer  details  may  be  lost.  (Additionally,  of  course,  there  is 
the  basic  arbitrariness  of  the  location  of  the  dimensions  and  thus  of  the 
factor  loadings.)  Thus,  whenever  possible,  hypotheses  should  be  tested 
without  first  subjecting  data  to  the  leveling  effect  of  factor  analysis. 

This  more  direct  type  of  testing  is  used  in  examining  three  hypothe- 
ses in  the  study;  (A)  that  there  will  be  no  sex  differences  in  complexity, 

(B)  that  firstborn  Ss  will  be  less  complex  than  laterborn,  and  (c)  that  in- 
discriminantly  pro-religious  Ss  will  be  less  complex  than  others.  Each 
hypothesis  as  stated  is  based  on  theories  about  complexity.  There  should 
be  no  sex  differences  because  no  one  has  ever  given  grounds  for  any,  or 
suggested  that  there  would  be.  Anticipated  differences  between  first  and 
laterborn  are  based  on  widespread  findings  (reported  in  Gardiner,  1968) 
that  firstborn  are  more  dependent,  acquiescent  and  conforming  and,  more 
importantly,  that  these  differences  are  based  on  childrearing  practices 
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which  differ  in  such  a way,  according  to  the  theorizing  of  Harvey  et  al. 
(1961),  as  to  justify  the  current  prediction,,  The  relationship  between  re- 
ligious conviction  and  complexity  is  based  largely  on  Harvey's  (1966)  state- 
ments and  scoring  criteria  for  the*  TIB.  Each  hypothesis  was  tested  by 
dividing  Ss  into  the  two  appropriate  groups  and  using  t tests  to  compare 
scores  on  the  other  variables. 

In  each  case,  the  hypothesis  was  not  supported.  Females  were  more 
complex  than  males.  Gardiner  explains  this  rather  plausibly  by  invoking 
some  clas s -related  differences  in  parental  treatment  of  boys  and  girls. 

The  failure  to  find  birth  order  effects  is  not  well  handled,  however.  Since 
he  argues  that  the  sex  differences  are  due  to  differences  in  child-rearing 
practices,  and  that  the  anticipated  birth  order  differences  would  have  been 
due  to  child-rearing  practices,  it  may  be  that  the  two  are  connected.  For 
example,  if  a disproportionate  number  of  firstborn  were  also  females,  the 
hypothesized  birth  order  effects  could  have  been  washed  out  by  the  hypo- 
thesized sex  effects. 

Finally,  indiscriminantly  pro-religious  (IPR)  Ss  were  found  to  be 
more  complex  than  other  Ss,  a result  opposite  that  expected.  Gardiner  at- 
tempts to  explain  this  first  by  stating  that  the  IPR  Ss  were  more  intrin- 
sically oriented  than  the  non-IPR  Ss  (a  "complex"  characteristic),  and, 
second,  by  pointing  out  that  the  Ss  were  young  (15-16  years  old)  and  thus 
not  yet  fully  differentiated  regarding  religion;  i.e.  that  the  IPR  group  in- 
cluded both  the  genuine  IPR  plus  intrinsically  religious  Ss  who  are  currently 
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also  global  but  will  not  be  in  a few  years.  While  this  is  not  unreasonable, 
it  also  avoids  some  issues.  Ss  are  all  of  an  age,  thus  differentiation  is 
relative.  One  would  still  expect  relatively  complex-  Ss  to  be  non-IPR.  At 
worst  they  might  be  unrelated,  if  religion  were  a distinct  domain  from  that 
in  which  complexity  is  measured.  In  the  present  case,  however,  IPR  Ss 
are  more  complex.  This  is  particularly  unexpected  in  light  of  the  fact  that 
the  measures  of  complexity  which  reach  significance  are  based  on  the  TIB 
scoring  criteria  which  state  that  IPR  behavior  represents  Level  I function- 
ing. It  would  seem  that  the  hypothesized  relationship  between  religious  be- 
lief and  integrative  complexity  is  not  present. 

On  this  basis,  then,  it  would  seem  that  the  PCT  and  the  TIB  are  re- 
lated, even  though  Gardiner  does  not  report  the  correlation  between  the 
two.  Unfortunately,  there  is  one  further  piece  of  evidence  which  contra- 
dicts this  finding.  Schroder  et_  al . (1967,  p.  115)  comment  virtually  en 
passant  that  the  two  tests  bear  only  a "low-order  relationship."  So  the 
picture  is  confused. 

Summary 

Both  Schroder  and  Harvey  share  the  position  that  integrative  com- 
plexity is  the  most  important  component  of  cognitive  complexity.  This 
makes  their  position  distinguishable  from  that  of  the  more  dimensionally 
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oriented  theorists  (Bieri,  1961;  Scott,  1969).  It  also  suggests  that  the  most 
immediate  question  is  whether  Schroder  and  Harvey  agree  on  their  mea- 
sures of  integrative  complexity,  rather  than  whether  integration  is  related 
to  dimensionality. 

Regarding  the  comparability  of  the  Schroder  and  Harvey  positions  and 
measures,  first,  there  is  evidence  that  in  factor  analyses  the  two  tests  con- 
stitute a separate,  common  factor:  Vannoy  (1964)  found  the  PCT  defining 

its  own  factor,  and  Gardiner  (1968)  showed  that  the  PCT  and  TIB  (plus  the 
ITT,  a derivative  of  the  PCT)  all  load  highly  on  a factor  by  themselves. 

The  correlation  between  the  measures  is  not  reported  in  the  Gardiner  study, 
however,  and  Schroder  et  al„  (1967,  p.  115)  have  stated  that  the  two  tests 
bear  ’'only  a lowT-order  relationship."  Thus,  the  evidence  is  conflicting. 

Second,  although  they  are  not  predicted,  sex  differences  in  complex- 
ity have  been  found  in  at  least  one  case  (Gardiner,  1968).  These  differ- 
ences were  attributed  to  differences  in  child-rearing  practices  for  the 
sexes,  but  it  is  not  impossible  that  other,  albeit  more  subtle,  sex-related 
differences  in  child-rearing  practices  and/or  cultural  role  expectations 
would  affect  complexity  at  other  socioeconomic  levels.  Thus,  the  question 
of  whether  there  are  sex  differences  in  complexity  is  not  settled. 

There  is  another  area  of  ambiguity  which  will  be  briefly  mentioned 
here.  Schroder  et  al.  (1967,  p.  197)  also  state  that  the  correlations  be- 
tween PCT  scores  and  intelligence  have  ranged  between  . 12  and  .45  for 
various  samples.  This  may  remind  us  that  the  second  highest  loading  on 
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Yannoy's  "PCT  factor”  was  SCAT -Q,  and  perhaps  also  of  the  analytic  cast 
to  Gardiner’s  (1968)  third  "general  complexity”  factor.  (One  of  Gardiner's 
greatest  deficiencies  was  his  failure  to  include  an  intelligence  measure.) 
The  general  question  is,  then,  what  is  the  relationship  between  intelligence 
and  integrative  complexity? 

A somewhat  related  question  is  that  of  sample  characteristics.  Not 
only  are  Ss  typically  college  students,  but  college  students  from  fairly  un- 
representative schools.  The  potential  effects  of  these  biases  have  appar- 
ently never  been  fully  considered. 

Before  further  developing  these  points,  some  background  discussion 


of  theory  seems  appropriate. 


Chapter  II 

CONCEPTUAL  SYSTEMS:  THEORY 


The  fullest  description  of  this  theoretical  position  is  found  in  Harvey, 
Hunt  and  Schroder's  (1961)  book.  The  basic  ideas  from  this  source  appear 

I 

in  briefer,  more  readable  form  in  Harvey  and  Schroder  (1963),  Harvey 

I 

(1966),  and  to  some  degree  in  Schroder  and  Harvey  (1963).  In  general,  it 

j 

* i 

appears  that  Harvey  continues  to  espouse  an  essentially  unchanged  version 

of  the  1961  theory,  whereas  Schroder  harbors  some  revisionist  tendencies, 

I 

hints  of  which  may  be  noted  in  the  Schroder  and  Harvey  (1963)  paper. 

i 

' 


The  Theory  of  Conceptual  Systems: 

Harvey,  Hunt  and  Schroder 

Conceptual  systems  are  composed  of  concepts  and  their  interrelations. 

! 

r 

The  definition  of  "concept"  varies  somewhat  from  place  to  place,  at  one 

i 

point  being  defined  so  as  to  imply  a dimensional  characteristic  (".  . .a 
system  of  ordering  that  serves  as  the  mediating  link  between  the  input  side 
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[stimulus]  and  the  output  side  [response]"  p„  1 ),  in  another  case  suggest- 
ing that  concepts  are  points  along  dimensions  (concepts  represent  "a  cate- 
gory of  varying  definitiveness  and  breadth  along  some  specifiable  dimen- 
sion. o . p.  10),  and  in  still  an'other  situation  saying  that  a concept  is 
minimally  a comparison  of  two  points  on  a dimension,  and  may  be  a point 
or  points  in  a space  of  infinite  dimensionality  ("The  simplest  concept  is  one 
involving  two  points  on  a single  dimension,  and  the  most  complex  concept 
is  one  embodying  an  infinity  of  points  on  an  infinity  of  dimensions."  p,  62). 
These  differences  are  probably  not  critical,  although  the  last  seems  differ- 
ent as  well  as  overdrawn.  The  basic  idea  is  apparently  quite  similar  to 
George  Kelly's  (Kelly,  1955);  namely,  that  organization  is  imposed  on  per- 
ceptual experience,  that  "meaning"  (or  "understanding")  derives  from  the 
imposed  organization,  and  that  this  organization  is  in  varying  degrees  idio- 
syncratic and  arbitrary.  Concepts  ("constructs"  and  "contrasts,  " in 
Kelly's  terminology)  are  the  terms  in  which  the  world  is  viewed;  that  is, 
concepts  are  the  content  of  conceptual  systems.  Harvey,  Hunt  and  Schroder 
wish  to  regard  differentiation-integration  as  a process,  and  concreteness - 
abstraction  as  a result  of  the  process.  Consequently,  neither  of  these 
dimensions  is  inherent  in  the  stimuli,  but  rather  in  the  conceptualization  of 
the  stimuli. 

An  increasing  number  of  concepts  implies  increasing  differentiation 
"'‘In  this  section,  page  numbers  refer  to  Harvey,  Hunt  and  Schroder, 


1961. 
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on  the  part  of  the  person,  but  greater  abstractness  is  not  an  automatic  con- 
comitant  of  greater  differentiation.  True  abstractness  requires  integra- 
tion of  concepts  as  well  as  differentiation,  and  differentiation  may  occur 
without  subsequent  integration.  Also,  although  differentiation  is,  in  some 
sense,  a logical  prerequisite  to  integration,  these  authors  argue  that  the 
two  can  occur  essentially  simultaneously.  Unfortunately,  important  as  it 
clearly  is  to  the  theory,  the  nature  of  integration  is  never  really  made  clear. 
"Integration  is  the  relating  or  hooking  of  . . . parts  to  each  other  and  to 
previous  conceptual  standards"  (p„  18).  A straightforward  interpretation 
would  be  that  differentiation  involves  seeing  differences  and  integration  in- 
volves seeing  similarities,  both  of  which  can  be  along  dimensions.  How- 
ever, the  book  consistently  suggests,  and  there  are  explicit  statements 
elsewhere  (e.  g.  Schroder  et  al „ . 1967,  p.  7),  that  dimensions  and  integra- 
tive rules  are  qualitatively  different.  This  failure  to  clarify  the  integration 
concept  is  a near  fatal  weakness  in  the  theory,  and  one  which  persists  in 
the  current  positions. 

Given  proper  parental  training  practices  (to  be  discussed  below), 
children  will  progress  from  concrete  to  abstract  functioning.  If  conditions 
are  unfavorable,  arrestation  will  occur  at  some  point  along  the  dimension, 
the  stopping  point  being  determined  by  the  nature  of  the  unfavorableness. 

Of  course,  the  level  to  which  a person  progresses  is  likely  to  vary  from  one 
content  domain  to  another,  again  as  a function  of  varying  training  conditions. 
Harvey,  Hunt  and  Schroder  have  concentrated  on  the  interpersonal  domain, 


20 


which  is  appropriate,  but  how  and  the  degree  to  which  their  discussion 
would  generalize  to  a more  content-oriented  domain  (e.  g„  "history")  is  not 
always  clear. 

In  the  theory,  introduction  into  a new,  unfamiliar  realm  of  experi- 
ence necessitates  relatively  concrete  behavior,  i.e.  dependency  on  external 
sources  or  authority.  With  increasing  familiarity  the  potential  for  increas- 
ingly abstract  behavior  develops.  The  authors  cite  a number  of  develop- 
mental theorists  in  support  of  their  argument  that  children  typically  pass 
through  several  concrete  to  abstract  cycles,  the  concrete  phases  corre- 
sponding to  the  times  at  which  children  begin  to  talk  (roughly  age  2),  begin 
social  behavior  (age  5),  enter  peer -centered  society  (age  10),  and  begin  to 
take  on  adult  roles  (apparently  seen  as  about  age  17).  The  first  cycle  pro- 
gresses in  the  direction  of  abstraction  only  as  far  as  the  second  stage  or 
level,  or  perhaps  to  the  transition  between  the  second  and  third  levels. 
Later  cycles  progress  as  far  as  Level  IV. 

The  first  stage  (Level  I)  is  labeled  "unilateral  dependence."  It  might 
also  be  termed  a "pro-authoritarian  stage":  the  connotative  baggage  seems 

to  be  the  same  (provided  emphasis  is  on  style  or  process,  not  content).  If 
differentiation  and  integration  occur  to  a proper  degree,  Level  II  will  be 
reached,  ^ which  is  described  as  a period  of  negative  independence,  and  in 


Much  of  the  elaborateness  of  the  book,  which  is  considerable,  is  due 
to  extensive  consideration  not  only  of  the  major  stages  but  also  of  the  trans- 
itions between  stages.  This  material  seems  to  have  dropped  out  later  and 
will  not  be  considered  here. 
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at  least  some  forms  bears  a strong  resemblance  to  anti-authoritarianism. 
The  categoricalness  is  similar  to  Stage  I.  The  distinction  between  the 
levels  is  on  the  basis  that  at  Level  II  the  person  is  differentiating  himself 
from  the  external  authority,  whereas  at  Level  I he  is  defining  himself  in 
terms  of  it.  Exploring  this  differentiation  takes  the  form  of  resistance  to 
the  authority.  Integration  of  the  two  possibilities,  necessary  for  progress, 
would  make  available  to  the  person  some  form  of  both  attitudes  and  both 
types  of  behavior.  Fixation  at  this  level,  a failure  to  integrate,  would  leave 
only  a strong  negativism. 

Level  III  is  characterized  by  conditional  dependence  and  mutuality. 

In  this  stage,  hypothetical  or  conditional  ("as -if")  thinking  becomes  pos- 
sible, and  a more  objective,  information-oriented  attitude  is  taken  toward 
the  world.  Socially  the  person  at  this  level  of  abstraction  is  able  to  see 
another's  point  of  view  and  to  understand  the  other  person  from  that  point 
of  view.  Additionally,  the  third  stage  permits  the  "holding  of  alternative 
views  of  the  self,  of  events,  and  of  others  simultaneously  with  a minimum 
of  concern  for  ambiguity"  (p.  101).  By  implication  there  is  still  an  element 
of  dependency  and/or  a diffuseness  in  the  self-concept,  because  Level  IV  is 
defined  as  the  stage  at  which  "...  mutuality  and  autonomy  are  integrated 
so  that  neither  interferes  with  the  other  and  yet  both  are  important"  (p.  106). 
In  science  and  other  noninterpersonal  endeavors,  Level  III  appears  as  an 
empirical  orientation;  so  presumably  IV  is  the  theoretical  stage,  although 
the  analogy  with  mutuality  and  autonomy  does  not  appear  to  be  strong. 
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Level  IV  functioning  is  characterized  by  an  increased  sense  of  internal 
causality  as  well  as  increasing  problem-solving  ability,  tolerance  of  anxi- 
ety, ambiguity  and  inconsistency,  greater  resistance  to  stress,  and  a 
stronger  orientation  toward  informational  solutions  to  questions  or  prob- 
lems . 

If  training  conditions  are  unfavorable,  then  development  will  be  ar- 
rested at  some  point  along  the  scale,  the  point  of  arrestation  being  deter- 
mined by  the  nature  of  the  training  practices.  Again,  a super -articulated 
theory  will  be  greatly  simplified,  for  two  reasons:  to  conserve  space,  and 

because  the  theorists  themselves  have  apparently  paid  no  serious  attention 
to  the  elaborations  in  their  own  research. 

Training  is  of  two  major  types,  unilateral  and  interdependent.  Each 
of  these,  in  turn,  has  two  subtypes.  Thus,  the  four  major  types  of  train- 
ing are  (1)  reliable  unilateral,  (2)  unreliable  unilateral,  (3)  protective  inter 
dependent,  or  (4)  informational  interdependent.  These  dispose  toward 
arrestation  of  the  trainee  at  Levels  I through  IV,  respectively. 

Reliable  unilateral  training  implies  an  authoritarian  orientation  in 
the  trainer.  That  is,  the  criteria  for  rewards  and  punishments  are  exter- 
nal, "objective,  " rigid,  evaluative  and  absolute.  Good  behavior,  as  de- 
fined by  the  criteria,  is  consistently  rewarded,  and  bad  is  consistently 
punished.  There  are  few,  if  any,  lapses  in  either  direction.  Questioning 
of  the  criteria  is  not  permitted.  Requirements,  however,  must,  in  some 
sense,  be  within  the  capacity  of  the  trainee.  Training  of  this  sort  is  hypo- 
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thesized  to  lead  to  Level  I functioning. 

Unreliable  unilateral  training  (defined  from  the  trainee's  point  of 

I 

view)  occurs  primarily  either  when  the  trainee  is  unable  to  discover  the 
external  criteria  (e.  g„  when  feedback  is  too  erratic)  or  when  he  is  unable 

I 

to  conform  to  it  (as  when  the  demands  are  beyond  his  ability).  In  these  in- 
stances, the  trainee  either  sees  no  pattern  of  reward  and  punishment,  or 
builds  up  an  expectation  of  failure,  A third  condition  which  may  be  experi- 
enced as  unreliable  occurs  if  parents  are  indifferent  toward  and  neglectful 
of  their  children.  Fixation  at  Level  JJ  is  likely  to  follow  from  any  of  these 
circumstances . 

Protective  interdependent  training  is  based  on  systematic  rewards 
for  instrumental  behavior,  where  the  goal  of  the  behavior  is  not  evaluated 
against  external  criteria  as  in  the  reliable  unilateral  condition.  In  this 
case,  support  is  used  both  as  a reward  and  as  a means  of  directing  behav- 
ior.  This  support  is  likely  to  be  so  extreme  as  to  become  a tendency  to 
protect  the  trainee,  insofar  as  the  agent  is  capable,  from  failure  experi- 
ences, Thus,  although  relativism  and  an  emphasis  on  interaction  exist, 
there  is  also  a restriction  as  to  the  kinds  of  experiences  the  trainee  is  per- 
mitted, and  progress  beyond  Level  III  will  probably  not  occur. 

Under  informational  interdependent  training,  the  trainee  is  permitted 

i 

to  learn  from  the  consequences  of  his  own  experiences,  including  failure, 
with  the  important  proviso  that  the  environment  be  so  structured  that  the 
subject  is  not  pushed  beyond  his  capacities.  So  long  as  this  is  the  case, 


24 


the  trainer  feels,  protection  is  unnecessary  and  failure  is  as  informative 
as  success.  It  is  this  informational  aspect  of  experience  which  is  empha- 
sized and  valued  by  the  trainer.  Such  an  approach  permits  the  development 

j 

of  the  abstract  skills  required  for  Level  IV  functioning. 

For  present  purposes,  this  is  an  adequate  summary  of  the  1961  posi- 

i 

tion.  Modifications  in  the  theory  will  be  considered  below;  however,  it 
should  be  emphasized  that  what  Harvey,  Hunt  and  Schroder  set  out  to  do  was 

i 

to  describe  a cognitive  variable  which  is  independent  of  content  but  which 

i 

would  predict  certain  styles  of  behavior.  Seemingly,  they  have  been  suc- 
cessful at  a descriptive  level  at  least.  For  example,  theoretically  the 

i 

authoritarianism  of  Level  I may  be  of  the  left  or  right,  pro-  or  anti- 
religious,  behavioristic,  humanistic,  etc.,  and  supposedly  the  same  is 
true  for  each  of  the  other  levels.  As  we  will  see,  Harvey  has  moved  to- 
ward  a closer  relationship  with  content,  and  Schroder  has  moved  further 

i 

i 

awayc 

Before  discussing  such  changes,  however,  a problem  inherent  in  the 

| 

development  and  training  arguments  should  be  mentioned.  Basic  to  the 

! 

problem  is  the  fact  that  by  either  measurement  technique  (to  be  discussed 
below)  complex  Ss  will  comprise  only  4%  to  10%  of  a sample  ("complex"  has 
different  meanings --Level  IV  for  Harvey  and  approximately  a quarter  of 
the  way  between  II  and  III  for  Schroder).  That  is,  in  young  adult  (college) 

i 

j 

populations,  complex  people  are  quite  rare.  This  would  seem  consistent 
with  the  description  of  the  training  agency  hypothetically  required  to  pro- 

i 

i 
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duce  abstract  thinking.  It  is  not  consistent  with  the  discussion  of  develop- 
mental cycles,  in  which  it  is  stated  that  the  modal  (i.e.  most  common)  be- 
havior for  nine -year -olds  is  at  the  level  of  transition  into  the  fourth  stage, 
and  that  Level  IV  functioning  "is  rriore  likely  in  the  adolescent  cycle  of 
development.  " That  is,  at  age  9 or  10  the  most  common  functioning  is  very 
nearly  the  most  abstract  possible,  but  by  age  20  this  same  abstract  behav- 
ior has  become  the  least  common  found.  My  suspicion  is  that  the  theorists 
were  carried  away  in  their  desire  to  integrate  a variety  of  source  materi- 
als, although  there  is  at  least  one  alternative  explanation;  namely,  that 
parents  are  relatively  interdependent  and  schools  relatively  unilateral 
trainers,  and  that  by  college  the  increased  proportion  of  time  spent  in 
school,  plus  selection,  have  done  their  job.  This  hypothesis  seems  un- 
likely, however. 

Conceptual  Systems:  Theoretical  Modifications 

No  significant  theoretical  revisions  are  apparent  in  Harvey's  more 
recent  writings.  The  most  recent  exposition  of  which  I am  aware  (for  ex- 
ample, Harvey,  1966)  is  a concise,  convenient  summary  heavily  based  on 
Harvey  et  al.  (1961).  (The  tendency  toward  closer  ties  to  content  will  be 
discussed  in  conjunction  with  measurement.)  There  is  reduced  confusion 
over  the  term  "integration,  " but  this  results  not  from  elaboration  of  a con- 
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cept  which  is  different  from  dimensionality,  but  from  a simplification  of 
the  concept  until  it  becomes  itself  essentially  dimensional.  Integration  is 
said  to  be  "interrelating,  " which  can  easily  be  given  a dimensional  inter- 
pretation.  This,  in  turn,  means  that  there  has  become  little  distinction  be- 
tween Harvey's  theoretical  position  and  a strictly  dimensional  definition  of 
complexity  (e.g.  Bieri,  1966). 

Schroder  (1969)  stands  in  some  contrast  to  Harvey,  since  a number 
of  changes  have  been  introduced  into  his  theorizing,  most  of  them  having  to 
do  with  the  nature  of  the  concrete -abstract  dimension. 

First,  specification  of  the  dimension  has  even  less  to  do  with  content 
or  with  behavioral  styles  than  in  earlier  material.  The  dimension  is  con- 
ceived in  more  truly  structural  terms.  Schroder  no  longer  specifies  the 
nature  of  the  poles  of  the  developing  dimensions,  and  the  specification  of 
the  behavioral  characteristics  is  more  obviously  derived  from  the  structure 
of  the  cognition  as  opposed  to  the  nature  of  the  dimensional  poles.  Further, 
the  model  is  far  more  easily  generalizable  to  domains  other  than  inter- 
personal, and  the  progression  through  the  stages  is  more  logical  and  order- 
iy.  and  more  easily  seen  as  developmental.  All  of  this  has  strengthened 
the  model  enormously. 

Second,  the  ideas  of  differentiation  and  integration  have  been  ampli- 
fied, and  in  the  process  definitions  have  changed.  Finally,  there  is  a 
strong  tendency  toward  specification  of  three  instead  of  four  levels.  This 
is  probably  for  practical  reasons,  as  virtually  no  one  operates  even  at 


. 
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Level  III  on  the  device  used  by  Schroder  to  measure  conceptual  level. 

More  concretely  (if  that  can  be  permitted)  and  briefly,  Schroder's 
current  position  (Schroder,  1969)  is  as  follows:  Differentiation  is  now  de- 

fined as  "the  number  of  categories  or  kinds  of  information  extracted  from 
a given  domain  of  stimuli,  " or  the  number  of  unique  principles  of  stimulus 
ordering,  or  the  number  of  scales  or  dimensions  used  by  the  person  in 
viewing  stimuli.  These  scale  or  dimensional  values  are  the  informational 
input,  and  for  a given  stimulus  the  amount  of  information  will  vary  from 
person  to  person  with  the  degree  of  differentiation.  It  is  hypothesized  that 
people  learn  to  combine  these  values  in  various  ways  according  to  learned 
rules.  The  integrative  complexity  of  the  person's  conceptual  structure  is 
determined  by  the  number  of  these  combinatory  rules  and  by  the  "degree  of 
connectedness"  between  the  rules.  A combinatory  rule  is  a pattern  of 
weights  assigned  to  the  scale  values,  so  if  stimuli  are  scaled  along  three 
dimensions,  and  the  values  along  the  scales  are  weighed  50,  30,  and  20, 
then  the  resultant  values  have  been  arrived  at  by  a combinatory  rule  (alter- 
nately termed  a "perspective").  Level  I functioning  occurs  when  only  one 
such  combinatory  rule,  or  set  of  weights,  is  available  for  a given  set  of 
scales.  In  simplest  terms,  this  means  that  a concrete  person  is  one  who 
can  see  only  one  alternative  or  one  conclusion  in  a particular  situation. 
When  two  or  more  alternatives,  patterns  of  weights,  understandings,  per- 
spectives, interpretations,  judgments,  or  whatever,  are  available,  simul- 
taneously and  for  the  same  information,  then  the  person  is  operating  at 
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Level  II.  C onnectedness  between  these  two  or  more  perspectives  plays  the 
same  role  as  integration  or  integrative  rules  in  Harvey,  Hunt  and  Schroder 
(1961),  and  is  the  necessary  condition  for  Level  III 'functioning,  the  highest 
Schroder  considers„  "Connectedness"  is  every  bit  as  poorly  defined  as 
"integrative  rules,  " which  is  the  most  obvious  and  immediate  criticism  of 
the  theory.  To  connect  perspectives  is  to  relate  perspectives,  and  beyond 
this  the  concept  is  simply  not  developed.  My  best  guess  is  that  connecting 
perspectives  means  to  compare  them  against  some  superordinate,  relative 
construct,  like  goals,  values,  or  motives. 

In  spite  of  the  terrific  flaw  resulting  from  Schroder's  failure  or  in- 
ability to  specify  the  nature  of  connectedness  more  fully,  I find  his  posi- 
tion intuitively  and  aesthetically  more  attractive  than  Harvey's.  This  is 
not  to  say  that  there  are  not  other  problems,  or  that  there  are  not  viable 
alternatives . 

It  seems  clearly  to  be  the  case  that  perspectives,  the  results  of  com- 
binatorial rules,  may  be  regarded  as  dimensions.  Each  perspective  orders 
stimuli  in  a new  way,  which  is  the  definition  of  a dimension.  All  that  is 
necessary  to  locate  the  new  dimension/per spectives  is  to  find  an  axis  such 
that  the  cosine  of  its  angle  with  each  of  the  dimensional  axes  is  equal  to  the 
weight  given  ratings  on  that  dimension,  expressed  as  a decimal  fraction. 
There  may  be  cases  where  this  would  not  be  possible,  but,  in  general,  I 
suspect  it  could  be  managed. 

I can  imagine  two  objections  Schroder  might  raise.  First,  the  new 
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factor  would  be  oblique,  and  he  specifies  that  informational  dimensions  be 

mutually  orthogonal.  The  urgency  of  this  requirement  is  not  clear  to  me, 

■ 

inasmuch  as  orthogonal  dimensions  are  probably  rare  at  best,  and  may  be 

j 

psychometric  inventions.  Oblique  relationships  are  probably,  at  the  very 
least,  more  common.  (Schroder  uses  "friendliness”  as  an  example  of  a 

t 

dimension:  to  what  is  the  friendliness  dimension  orthogonal?)  It  is  not 

I 

I 

j 

even  clear  that  research  reported  later  in  the  chapter  conforms  to  this  re- 
quiremento  In  one  study,  stimulus  persons  were  described  along  three 
dimensions:  creativity,  orderliness,  and  reciprocity,  which  Schroder  re- 

ports  as  being  independent.  The  sense  in  which  they  are  independent  is  not 
clear  (statistically  or  cognitively),  and  it  may  well  be  that  at  least  some  Ss 
would  feel  moderate  dissonance  if  a- target  were  described  as  very  creative 
and  also  very  orderly. 

I 

The  second  objection  to  viewing  perspectives  as  dimensions  might  be 
based  on  the  statement  that  differentiation  is  the  "kinds  of  information  ex- 
tracted  from  a given  domain  of  stimuli."  The  implication  is  that  informa- 

| 

tion  is  somehow  intrinsic  in  the  stimuli,  whereas  perspectives  are  the  re- 
sult of  cognitive  processing.  I regard  this  as  an  untenable  distinction. 

; 

Examples  of  dimensions  used  by  Schroder  have  been  mentioned  above 

, 

(friendliness,  creativity,  orderliness,  reciprocity).  The  only  case  in  which 

i 

| 

these  are  "given11  occurs  when  an  experimenter  tells  a subject  that  a stim- 

- 

ulus  person  is  "friendly,  " etc.,  and  even  in  this  situation  the  connotations 
of  this  information  will  vary  among  £s . Normally,  making  judgments  along 
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dimensions  will  involve  processes  like  those  involved  in  perspectives.  The 
differences,  if  any,  will  have  to  do  with  the  ease  and/or  speed  with  which 
the  process  operates.  In  general,  perspectives  and  dimensions  seem  com- 
parable . 

Because  the  nature  of  connectedness  is  so  unclear,  it  is  more  diffi- 
cult to  attempt  a dimensional  reinterpretation.  However,  if  "connected- 
ness" means  ’’relating,  ” this  suggests  to  me  the  finding  of  similarities  and 
differences,  which,  in  turn,  suggests  dimensions  or  concepts  (equivalent 
in  Schroder's  terminology).  If  connectedness  is  in  terms  of  goals,  values 
or  motives  (one's  own  or  others,  simultaneously  or  one  at  a time),  I sus- 
pect these  could  be  represented  as  idealized  dimensions  or  points  in  the 
space,  against  which  comparisons  could  be  made. 

This  predilection  toward  a dimensional  model  demonstrated  above 
has  two  bases;  first,  a reaction  against  the  Schroder  and  Harvey  claim  that 
they  are  dealing  in  concepts  which  dimensional  approaches  cannot  explain, 
especially  since  the  integration  concept  is  never  specified,  and,  second, 
an  interest  on  my  part  in  pushing  the  dimensional  model  to  its  limits  before 
introducing  additional  constructs.  However,  even  if  this  dimensional  re- 
interpretation of  conceptual  levels  is  plausible,  Schroder  is  raising  an  im- 
portant point,  and  one  which  a dimensional  approach  may  have  trouble  re- 
presenting; namely,  the  issue  of  super-  and  subordinate  relationships 
among  dimensions  or  perspectives.  Kelly  (1955)  identified  two  types  of 
such  hierarchical  arrangements,  one  in  which  the  dimensions  lie,  as  it 
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were,  parallel  to  each  other  (e.g.  "intelligent -stupid"  is  in  some  degree 
subordinate  to  "good-bad"),  the  other  in  which  both  poles  of  one  dimension 
are  subordinate  to  one  pole  of  the  superordinate  dimension  (Kelly's  ex- 
ample: "light-dark"  are  both  desc-riptive  on  a "descriptive -evaluative" 

dimension)  o These  examples  could  probably  be  handled  by  leaving  empty 
quadrants  in  the  space,  but  this  seems  inelegant  and  the  general  issue  could 
quite  possibly  prove  thorny. 

Schroder's  connective  rules,  then,  would  probably  have  to  be  viewed 
as  superordinate  dimensions  if  they  are  to  be  dimensions  at  all.  Abstract 
functioning,  in  Schroder's  sense,  would  be  dependent  upon  the  development 
and  use  of  such  superordinate  dimensions.  Conceptual  systems  theorists 
have  argued  that  abstraction  is  not  related  to  dimensionality.  This  would 
be  an  agreeable  statement  so  long  as  the  dimensions  considered  are  at  a 
given  ordinal  level.  In  passing  it  should  be  noted  that  the  REP  test  (Kelly, 
1955),  commonly  used  to  measure  a dimensional  form  of  cognitive  complex- 
ity (e.g.  Bieri,  1966),  is  not  used  to  distinguish  super-  and  subordination. 
Some  dimensional  measure  is  needed  which  will  reveal  such  structure  be- 
fore it  can  be  determined  whether  the  relationship  proposed  above  exists. 

A comparison  of  the  current  theoretical  positions  of  Schroder  and 
Harvey  will  be  delayed  until  their  respective  measures  have  been  consid- 


Chapter  III 


CONCEPTUAL  SYSTEMS:  OPERATIONS 


Although  a number  of  procedures  are  viewed  as  yielding  scores  re- 

i 

levant  to  structural  complexity,  in  practice  both  Schroder  and  Harvey  have 

j 

evolved  particular  tests  which  they  regard  as  criterion  measures.  The  two 

i 

j 

tests  are  both  similar  and  different  and  will  be  discussed  in  this  chapter. 


Schroder's  Paragraph  Completion  Test 
(Schroder,  Driver  & Streufert,  1967) 


The  Paragraph  Completion  Test  (PCT,  Appendix  I)  presents  the  sub- 
ject  with  a word  or  phrase  which  is  supposed  to  be  the  beginning  of  a sen- 
tence, and  requests  S to  finish  the  sentence  and  write  two  or  three  further 
sentences  on  the  same  topic.  There  are  five  such  items,  plus  a warm-up, 
nonscored  item.  Each  item  is  timed,  the  limit  ranging  from  100  to  130 


seconds 


Items  are  rated  by  trained  judges,  scores  being  given  on  a 7 -point 
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scale,  the  odd  numbers  representing  the  four  major  levels.  The  mean  (or 
sum)  of  the  two  highest  item  scores  is  the  preferred  overall  score.  Inter- 
rater reliabilities  for  this  score  reportedly  range  from  .80  to  .95,  with 
inter-item  correlations  mostly  in  the  .40s,  ranging  between  . 39  and  . 62. 

No  test-retest  reliabilities  are  given. 

Scoring  criteria  are  as  follows  (based  on  Schroder  et  al.,  1967, 
Appendix  II):  A score  of  1 ("Low  Integration  Index"  or  Level  I)  is  given  for 
a response  generated  by  a single  rule,  where  alternatives  are  neither  pre- 
sent nor  considered,  and  where  thinking  is  categorical  and  without  condi- 
tions. Additionally,  (a)  conflict  is  viewed  as  bad,  and  uncertainty  or  ambi- 
guity as  indicating  weakness;  (b)  S seeks  fast,  unambiguous  closure  via  in- 
ternally consistent,  balanced  processes;  (c)  a specific  rule  or  guide  is  of- 
fered which  leads  to  or  implies  an  absolute,  categorical  solution;  (d)  S 
compartmentalizes  or  categorizes  facts  and  effects  and  tends  to  treat  only 
one  side  of  an  issue. 

Score  2:  An  absolute  score  is  qualified,  but  no  clear  alternative  is 

given. 


i 

i 


Score  3:  (Medium-Low  Integration  Index,  Level  II):  Two  or  more 

"alternative  rule  structures"  (or  perspectives  or  interpretations)  for  view- 
ing'the  event  are  simultaneously  present,  plus  possibly  a conditional  rule 
for  selecting  among  the  alternatives.  Indicators  include  (a)  mentioning  two 
or  more  different  interpretations  of  events;  (b)  listing  similarities  or  dif- 
ferences without  considering  relationships;  (c)  giving  conditional  rules  for 


34 


choosing  among  ways  of  categorizing;  (d)  reacting  against  absolutism  and/ 
or  avoiding  dependence  on  external  forces. 

Score  4;  Alternatives  are  offered,  with  the  implication  of  interaction, 
but  the  "interaction  is  expressed  as  a qualification  rather  than  as  the  emer- 
gence of  comparison  rules." 

Score  5;  (Medium-High  Integration  Index,  Level  III):  Alternative  in- 
terpretations are  given  and  interact  in  some  specified  way.  More  specifi- 
cally, (a)  the  alternatives  are  related  in  a way  which  maintains  the  ambi- 
guity or  conflict;  (b)  the  existence  of  the  alternatives  is  itself  regarded  as 
having  various  meanings;  (c)  other  people's  points  of  view  are  taken  into 
consideration;  (d)  indication  that  one's  behavior  is  influenced  by  others'  be- 
havior and/or  that  social  interactions  should  be  based  on  mutual  responsi- 
bility; (e)  seeing  alternative  reasons  for  similarities  and  differences  in  per- 
spectives. 

Criteria  for  scores  of  6 and  7 (7  is  the  High  Integration  Index,  Level 
IV)  are  described,  but  such  responses  are  exceedingly  rare  and  will  not  be 
considered  here. 

It  should  not  be  difficult  to  achieve  the  interrater  reliabilities 
Schroder  reports.  The  criteria  for  scores  above  5 are  extremely  abstract, 
but  since  such  responses  are  rarely  encountered  there  is  no  great  problem. 
One  difficulty  would  be  in  discriminating  between  "polarity  responses" 

(Score  1)  and  true  alternatives  (Score  3).  The  former  are  alternative  de- 
cisions based  on  a single  rule  (e.g.  "good  or  bad"),  whereas  the  latter  are 
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based  on  different  rules  (something  might  be  good  and  bad  simultaneously). 
Once  these  can  be  separated,  rater  reliability  should  not  be  hard  to  estab- 
lish. 

There  is  some  potential  confusion  about  the  distribution  of  scores, 
however.  Schroder  et  ad . (1967,  p.  195)  indicate  that,  for  the  most  con- 
crete and  most  abstract  of  their  samples  to  date,  approximately  4%  and 
12%  of  the  Ss  (respectively)  obtained  scores  of  7 on  a 7 -point  scale,  i.e. 
obtained  the  highest  possible  score.  At  that  time  a S's  score  was  either 
the  mean  of  all  his  item  scores  or  the  mean  of  the  two  highest  item  scores. 
Current  practice  is  to  add  the  two  highest  item  scores  together  to  decide 
a score,  so  these  are  distributed  between  2 and  14  (a  13-point  scale).  The 
confusion  lies  in  the  fact  that  the  scores  on  this  13 -point  scale  are  seldom 
higher  than  7 and  that  this  distribution  of  scores  on  the  lower  half  of  the 
13 -point  scale  (the  lower  half  of  the  complexity  dimension)  may  be  mis- 
taken for  the  distribution  reported  in  Schroder  et  al.  for  the  entire  7 -point 
scale  (the  whole  dimension). 

Schroder  explains  this  shift  in  distributions  ^ by  stating  that  there  has 
been  a change  in  scoring  procedures  (the  changes  appear  to  be  in  the  appli- 
cation of  the  scoring  criteria,  not  in  the  criteria  per  se),  so  that  now  raters 
are  being  especially  careful  to  avoid  making  liberal  assumptions  about  or 
interpretations  of  responses,  since  these  were  leading  to  unreliable  dis- 

*H.  M.  Schroder,  personal  communication. 
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tinctions  in  the  upper  range  of  scores,, 

The  implications  of  the  shift  are  unclear.  The  ’’new"  distribution  is 
more  nearly  normal  than  the  "old"  one,  i.e.  it  is  sharply  peaked  in  the 
center  and  has  few  scores  at  eithe'r  extreme.  The  old  distribution  was 
much  flatter,  and  in  some  samples,  the  mode  was  the  lowest,  most  con- 
crete score.  The  new  scores  are  obviously  not  merely  a scaling-down  of 
the  old  distribution.  Thus,  the  relationship  between  complex  Ss  as  defined 
by  Schroder  et_  aL  (1967)  and  as  they  are  used  today  is  not  certain.  It  is 
not  clear  what  score  was  required  to  constitute  complexity  in  Schroder  et  al. 
but,  by  present  standards,  a complex  S is  one  for  whom  the  sum  of  his  two 
highest  item  scores  is  at  least  7 (3.5  on  the  7-point  scale);  that  is,  very 
slightly  above  Schroder's  "Medium-Low  Integration  Index,  " or  very  slightly 
above  Level  II  in  Harvey  (1966)  or  Harvey,  Hunt  and  Schroder  (1961)  termi- 
nology (ignoring  theoretical  differences).  Put  differently,  a complex  per- 
son, by  Schroder's  current  definition,  is  one  who  sees  alternative  interpre- 
tations to  events,  and  who  additionally  shows  the  tiniest  wisp  of  ability  to 
integrate  interpretations.  It  seems  likely  that,  if  anything,  complexity  so 
defined  is  even  less  common  than  complexity  as  defined  by  Schroder  et  al. 
(1967). 

Obviously  there  is  nothing  wrong  with  being  interested  in  uncommon 
characteristics,  but  such  an  interest  does  introduce  certain  problems.  In 
the  present  case,  the  theory  becomes  virtually  inapplicable  to  naturally 
occurring  groups  (a  problem  exacerbated  by  the  restricted  variance  of  the 
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scale  and  by  Schroder’s  habit  of  using  extreme  samples  and  ignoring  the 
middle  group).  Further,  the  reliability  of  the  test  becomes  extremely  im- 
portant. If  the  test  misidentifies  subjects,  considerable  mischief  can  be 
done.  Given  this  fact,  it  is  especially  odd  that  Schroder  does  not  report 
test-retest  reliabilities. 

Insofar  as  validity  is  concerned,  the  test  does  appear  to  make  an  im- 
portant distinction  among  Ss  regarding  their  information-processing  and 
decision-making  capacities.  Schroder's  typical  procedure  is  to  test  a 
large  enough  pool  of  Ss  to  obtain  a given  number  of  "complex"  persons. 
"Simple"  Ss  are  then  chosen  who  match  the  complex  Ss  on  important  control 
variables  (intelligence,  in  particular,  but  sometimes  other  variables  also). 
Groups  of  Ss  so  defined  differ  in  the  amount  of  information  they  are  able  to 
utilize  in  decision-making,  and  in  the  quality  of  the  decisions  made,  the 
complex  groups  being  more  coordinated,  interrelated  and  self-directed. 

The  tasks  to  which  the  groups  apply  themselves  vary,  but  are  character- 
ized as  being  long  term  (usually  several  hours  overall),  relatively  realistic 
and  interactive  such  that  what  a group  does  affects  what  happens  later. 

There  is  an  interesting  anomaly  here.  Schroder  feels  that  his  PCT 
measures  interpersonal  complexity,  and  he  exhibits  the  usual  caution  about 
generalizing  to  other  domains.  Yet  the  games  or  tasks  which  the  groups 
are  given  are  not  interpersonal  in  nature,  thus  there  would  not  appear  to  be 

*1  am  indebted  to  Lawrence  Strieker  for  pointing  this  problem  out  to 
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any  basis  for  predicting  differences  between  the  performances  of  inter- 
personally  simple  and  complex  groups  » Schroder  obviates  this  objection  by 
arguing  that  the  content  of  the  game  is  not  so  important  as  the  fact  that  de- 
cisions must  be  made  in  a group,  so  interpersonal  dynamics  are  important, 
and  the  PCT  again  becomes  relevant.  One  study  has  been  run  using  single 
subjects.  In  this  case,  complex  Ss  were  still  superior  to  simple,  but  here 
the  task  had  to  do  with  learning  about  a culture  and  solving  a problem  based 
on  one's  knowledge  about  and  understanding  of  that  culture.  In  this  case, 
Schroder  argues  that  the  task  is  analogous  to  interpersonal  situations,  in 
that  similar  skills  are  required  for  success  in  both  situations.  So  far  as  I 
know,  individuals  have  never  been  run  on  the  noninterpersonal  games. 

In  summary,  the  PCT  is  a sentence  completion  measure  of  integra- 
tive complexity.  It  has  satisfactory  interrater  reliabilities,  and  no  obvious 
serious  problems  in  the  rating  process.  Test-retest  reliability  is  not 
established;  however,  extreme  groups,  matched  for  intelligence  (a  corre- 
late of  PCT  scores)  do  differ  systematically  in  predicted  ways  on  informa- 
tion-processing and  decision-making  tasks.  One  problem  with  the  test  is 
that  complex  persons  are  rare,  which  emphasizes  the  importance  of  the 
missing  test-retest  correlation,  and  which  limits  the  applicability  of  the 
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Harvey's  "This  I Believe"  Test  (Harvey,  1966) 

The  "This  I Believe"  Test  (TIB)  is  similar  in  format  to  the  PCT.  Ss 
are  presented  with  a series  of  sentences  reading,  "This  I believe  about 
. . . , " the  blank  being  filled  in  by  such  referents  as  the  American  way  of 
life,  marriage,  religion,  friendship,  etc,  (see  the  Appendix)*  Items  are 
timed,  and  Ss  are  requested  to  write  at  least  three  sentences.  Generally, 
about  10  items  are  used* 

Trained  raters  assign  scores  to  the  protocols.  However,  individual 
items  are  not  scored.  Rather,  the  entire  test  is  given  one,  overall  global, 
score.  An  attempt  is  made  to  place  each  S at  one  or  another  of  the  four 
levels.  Some  protocols  are  mixed,  • however,  and  functioning  at  more  than 
one  level  is  indicated.  In  these  cases,  two  (or  even  three)  scores  may  be 
assigned. 

Harvey  states  (1966,  p.  47)  that  the  "inter-judge  reliability  for  three 
and  four  trained  judges  in  classifying  subjects  into  one  of  the  four  main 
systems  has  been  .90  or  above  for  12  different  samples  of  subjects."  I 
take  this  to  mean  that,  for  the  Ss  for  whom  each  judge  gives  a single  as- 
signment, interrater  reliabilities  equal  or  exceed  .90,  and  that  Ss  given 
more  than  one  rating  do  not  appear  in  these  correlations.  Harvey  reports 
that,  out  of  1400  TIBT’s,  30%  were  System  1,  15%  System  2,  20%  System  3, 
7%  System  4,  and  the  rest  (28%)  were  mixed. 

No  test-retest  reliability  is  reported,  so  far  as  I have  been  able  to 
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discover . 

Scoring  criteria  can  best  be  described  as  messy.  They  are  complex, 
rather  content-oriented  (theory  to  the  contrary  notwithstanding),  and  multi- 
dimensional in  nature.  For  example,  Harvey  describes  some  of  the  more 
important  criteria  as, 

. o . the  absolutism  of  his  expressed  beliefs,  consideration  of  contin- 
gencies or  modifying  circumstances,  dependency  on  external  authori- 
ties, especially  God  and/or  religion,  frequency  of  trite  and  normative 
statements,  degree  of  ethnocentrism,  acceptance  of  socially  approved 
modes  of  behavior,  concern  with  interpersonal  relationships,  and  the 
apparent  simplicity-complexity  of  the  interpretations  of  the  world 
(Harvey,  1966,  p.  47). 

More  specifically  (Harvey,  undated),  System  1 Ss  tend  to  be  absolute 
and  closed  in  their  thoughts  and  beliefs,  to  be  highly  evaluative  and  con- 
ventional, to  show  a strong  positive  attitude  toward  traditional  authorities, 
to  be  highly  ethnocentric,  and  to  identify  closely  with  social  roles,  status 
requirements,  etc.  These  Ss  appear  to  have  internal  standards,  but  these 
derive  from  reliance  on  external  authority.  This  orientation  may  be  char- 
acterized as  pro-authoritarian. 

System  2 Ss  are  anti -authoritarian.  They  dislike  and  distrust  author- 
ity and  automatically  rebel  against  norms  and  social  pressures.  They  are 
as  rigid,  simple  and  consistent  as  System  1 Ss,  but  in  the  opposite  direc- 
tion. They  add  hostility,  rejection  and  negativity  to  the  concrete,  non- 
analytic,  categorical  thinking  of  System  1.  Harvey  also  states  that,  while 
these  Ss  reject  conventional  standards,  they  are  simultaneously  without 
other  stable  guidelines.  They  lack  personally  derived  standards,  and 
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Harvey  does  not  mention  (but  implicitly  rejects)  the  possibility  of  subgroup 
norms  „ 

System  3 S_s  appear  to  be  high  in  need  affiliation.  They  are  socially 
oriented  and  friendly.  They  tend  to  be  dependent  on  others  rather  than  on 
their  own  resources  in  the  solution  of  their  problems.  They  are  quite  ac« 
quiescent  in  a socially  accommodative  sort  of  way.  Their  opinions  tend  to 
be  flexible  and  superficial.  They  feel  that  everyone  needs  lots  of  friends. 
The  hypothesized  increase  in  complexity  over  Systems  1 and  2 is  based  on 
the  assumption  that  these  Ss  have  learned  more,  and  more  sophisticated, 
techniques  for  dealing  with  other  people  (System  3's,  after  all,  are  mani  = 
pulative  --neces sary  if  they  are  to  be  dependent),  and  are  therefore  more 
goal-oriented  and  more  flexible. 

Unless  it  is  "inner -directed,  " no  familiar  label  exists  for  System  4 
Ss.  They  are  supposed  to  be  independent,  flexible,  open,  nonevaluative, 
nonjudgmental,  and  relative.  They  like  novelty,  and  are  relatively  able  to 
tolerate  ambiguity  and/or  dissonance.  They  are  "information-oriented,  " 
which  is  to  say,  rational,  analytical,  and  empirical.  They  tend  to  present 
and  juxtapose  diverse  elements  in  their  responses,  and  to  evaluate  these 
noncategorically  and  relativistically. 

These  criteria  are,  categorically,  difficult  to  apply,  for  the  reason 
that  the  criteria  within  a system  are  not  necessarily  highly  correlated. 

This  means  that  it  is  not  uncommon  to  get  some  indicators  but  not  others, 
to  get  an  indicator  that  does  not  distinguish  between  two  systems,  or  worse, 
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to  get  indicators  from  more  than  one  system,  For  example,  if  a respond- 
ent  sees  no  value  in  (condemns)  the  American  Way  of  Life  (Level  2)  but 
simultaneously  argues  that  people  should  concentrate  on  learning  how  to 
communicate  better  with  each  other  (not  System  2,  because  an  alternative 
value  system  is  proposed),  should  the  score  be  2 or  4? 

A number  of  other  rules  exists  for  scoring,  having  to  do,  for  example, 
with  the  overall  tone  (pessimistic “optimistic)  of  the  response,  the  attitude 
toward  friends,  etc.  The  effect  of  these  rules  is  to  lead  to  difficulty,  for 
this  rater  at  least,  in  discriminating  between  I's  and  3's,  on  the  one  hand, 
and  2's  and  4's  on  the  other.  If  the  decision  is  at  all  close,  the  best  stra- 
tegy,  if  one  is  attempting  to  establish  rater  reliability,  is  to  give  both 
scores  and  thus  remove  the  S from  the  pool.  On  such  ambiguous  protocols 
it  is  not  particularly  difficult  to  obtain  moderate  negative  correlations  be- 
tween raters.  If  Harvey  is  talking  about  a dimension,  confusion  should  oc- 
cur between  adjacent  categories,  not  alternate  ones.  If  assignations  jump 
about  as  described  above,  and  in  my  experience  there  is  this  tendency,  then 
the  unidimensionality  of  Harvey's  four  systems  seems  questionable. 

A1  so,  it  should  be  noted  that  there  is  considerable  dependence  on  con- 
tent in  the  scoring.  A strong  positive  attitude  toward  religion  and/or  other 
institutions  constitutes  System  1,  negative  attitudes  System  2.  Someone 
who  feels  that  a lot  of  friends  are  important  is  a System  3,  but  if  friendship 
is  unreliable  he  is  System  2.  Examples  could  be  easily  added.  In  practice, 
only  with  System  4 criteria  do  structural  considerations  (how  something  is 
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said  rather  than  what  is  said)  begin  to  outweigh  content.  I would  suggest 
that  for  the  first  three  systems  the  TIB  primarily  measures  content  rather 
than  structure,  and,  in  fact,  measures  structure  only  insofar  as  it  is  re- 
lated to  content. 

Of  course,  Harvey  does  see  a relationship  between  content  and  struc- 
ture, and  the  relationship  is  based  on  theory.  In  the  process  he  reveals 
his  theoretical  conservatism,  because  the  derivation  of  the  relationship  can 
be  found  (perhaps  by  implication  for  some  specifics)  in  Harvey,  Hunt  and 
Schroder  (1961).  The  original  theory  specifies  a universal  sequence  (at 
least  in  this  culture)  in  which  interpersonal  experience  is  differentiated  and 
integrated,  including  in  a general  way  the  nature  of  the  developing  dimen- 
sion at  each  stage.  This,  in  turn,  implies  something  about  the  content  of 
attitudes  and  behavior,  in  the  sense  of  the  focus  of  interest  and  the  general 
style  of  approach  to  the  topic.  It  is  on  this  specification  that  Harvey  draws 
for  his  scoring  criteria.  The  circularity  of  this  process  requires  that  the 
validating  evidence  be  particularly  sound. 

Parenthetically,  it  should  be  noted  that  Harvey  does  not  report  re- 
sults as  correlations.  Like  Schroder,  he  picks  groups  of  Ss  who  operate 
at  various  levels.  Unlike  Schroder,  his  Ss  are  chosen  from  the  full  range 
of  his  variable.  Instead  of  "simple"  and  "complex"  groups,  Harvey  usually 
uses  equal  numbers  of  Ss  from  each  of  the  four  systems  (levels).  These 
groups  are  then  compared  either  using  _t_  tests  or  analyses  of  variance. 

(Of  course,  the  latter  would  be  preferred,  but  Harvey  does  not  mention 
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which  is  used.)  Typically,  nonlinear  results  are  found;  e.g.  Systems  2 and 
4 are  higher  or  lower  than  1 and  3.  The  effect  of  this  is,  first,  to  support 
Harvey's  rejection  of  correlation  in  favor  of  group  comparisons  (which 
should  be  by  analysis  of  variance);  and  second,  to  again  raise  the  question 
of  whether  the  four  systems  should  be  viewed  as  falling  along  a single  con- 
tinuum. 

Harvey  (1966)  summarizes  a great  deal  of  evidence  intended  to  vali- 
date the  TIB.  Most  of  these  reports  fall  into  one  of  two  groups.  The  first 
group  constitute  trivial  findings,  theoretically  speaking.  At  best  these 
could  be  regarded  as  validating  the  scoring  criteria,  and  it  seems  much 
more  appropriate  to  regard  these  as  hetero-method  reliabilities.  These 
relationships  should  obtain  because  they  are  built  into  the  rating  criteria. 

I include  here  the  findings  on  authoritarianism,  dogmatism,  need  for  affi- 
liation, most  of  the  religion  material,  EPPS,  Scott's  Scale  of  Values,  and 
"identification  with  the  American  motif.  " If  these  had  not  proved  agreeable, 
then  the  application  of  the  rating  criteria  would  be  under  question. 

There  is  a small  subgroup  which  does  seem  to  be  of  a construct 
validity  nature.  First,  the  WAIS  has  been  administered,  and  differences 
between  the  groups  are  found  only  on  Verbal  Intelligence  and  Vocabulary, 
Systems  4 and  2 being  significantly  higher,  although  Harvey  claims  that  the 
systems  do  not  vary  in  vocabulary  usage  on  the  TIB.  Of  course,  it  is  de- 
sirable that  the  TIB  and  PCT  be  independent  of  intelligence;  however,  this 
finding  is  at  variance  with  Schroder  on  two  counts.  First,  Schroder  re- 
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ports  that  intelligence  typically  correlates  with  integrative  complexity  at 
some  level  between  . 2 and  . 5,  a linear,  as  opposed  to  Harvey's  nonlinear, 
finding,,  Second,  Schroder  has  argued  (Schroder  et'al.,  1967)  that  of  the 
components  of  intelligence  the  one  which  should  be  least  related  to  com- 
plexity is  vocabulary,  because  it  is  based  largely  on  memory  and  no  cogni- 
tive processing  at  all  is  required  for  high  scores. 

A second  validating  relationship  is  between  cognitive  complexity  as 

I 
I 

measured  by  "a  modification"  of  Kelly’s  REP  test.  In  this  case,  a linear 
relationship  was  found  between  complexity  and  the  four  systems,  System  4 
being  most  complex.  Note  that  this  substantiates  the  position  taken  earlier 
that  Harvey's  is  now  essentially  a dimensional  theory,  and  that  it  weakens 
any  argument  he  might  make  to  the  effect  that  his  test  measures  something 
beyond  dimensionality.  It  also  serves  the  valuable  function  of  ordering  the 
four  systems  sequentially,  and  is  very  nearly  the  only  variable  to  do  so. 

The  third  construct  validity  relationship  is  with  rigidity.  This  mea- 
sure is  based  on  the  Gough  and  Sanford  Scale  on  Rigidity,  with  which  I am 
not  acquainted,  and  the  reference  for  which  is  unpublished.  Consequently, 

I can  only  report  that  this  scale  also  ordered  the  systems  sequentially, 
System  1 being  the  most  rigid. 

The  second  major  group  of  validating  studies  is  more  impressive. 

Here  anticipated  behavioral  correlates  of  complexity  were  examined.  The 
reports  are  exceedingly  brief  and  cannot  really  be  adequately  evaluated. 

Tasks  intending  to  measure  a variety  of  learning  or  concept  formation  or 
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cue  utilization  skills,  ability  to  change  set,  creativity  (novelty  plus  appro- 
priateness), admission  and  tolerance  of  deviant  inputs,  role  playing,  etc., 
were  involved.  In  general,  Systems  4 and  1 behave  appropriately,  with  2 
and  3 varying  back  and  forth  in  the  middle.  There  are  some  hints  that  the 
studies  are  not  independent,  the  same  Ss  being  used  on  various  tests,  and/ 
or  the  same  test-data  being  analyzed  in  several  different  ways.  Also,  no 
mention  is  made  of  controls  for  intelligence,  which  definitely  should  have 
been  included.  If  a fuller  report  of  the  work  met  these  objections,  how- 
ever, support  would  accrue  for  the  theoretical  position. 

To  summarize,  the  TIB  relies  on  scoring  criteria  which  are  not  fully 
satisfactory,  partly  because  they  are  too  heavily  dependent  on  content  in- 
dicators for  one  to  be  convinced  that  the  test  measures  structural  proper- 
ties, and  partly  because  the  criteria  defining  a particular  system  do  not 
seem  to  constitute  an  especially  cohesive  characterization.  The  latter 
leads  to  problems  in  assigning  Ss  to  systems,  and  causes  a fairly  high  dis- 
qualification rate  (between  one -fourth  and  one -third  of  the  Ss).  Validational 
evidence  is  spotty.  In  some  cases  positive  results  are  built  into  the  scor- 
ing criteria.  Other  predictive  validity  findings  are  more  impressive,  but 
suffer  from  the  brevity  of  description  and  from  the  implied  possibility  of 
lack  of  independence  and  control  for  intelligence.  Beyond  this,  the  TIB 
apparently  is  related  to,  but  not  correlated  with,  intelligence,  and  is  cor- 
related with  Bieri's  type  of  cognitive  complexity  (Bieri,  1966).  On  other 
variables,  more  often  than  not,  there  is  a reversal  in  the  sequence  such 
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that  Level  Z is  more  similar  to  4,  and  Level  3 to  Level  1,  rather  than 
scores  being  sequentially  ordered.  Harvey  is  superior  to  Schroder  in  that 
his  scores  are  more  evenly  spread  along  his  dimension,  and  that  he  con- 
siders the  full  dimension  in  his  re-search  rather  than  the  extremes. 

A Comparison  of  the  Two  Positions 

At  a theoretical  level,  both  of  Harvey's  first  two  systems  seem  to 
represent  Level  I functioning  for  Schroder.  For  example,  a System  1 re- 
sponse might  be  "Law  and  order  must  be  maintained  at  all  costs,  " whereas 
a System  Z S might  state,  "Cops  stink.  " Each  of  these  represents  a single 

I 

perspective,  and  so  would  be  scored  1 by  Schroder.  Harvey's  System  4 
sounds  a good  bit  like  Schroder's  score  of  3 (Level  II),  in  that  a critical 
characteristic  for  both  is  the  ability  to  see  alternative  points  of  view  (al- 
though Harvey  seems  to  emphasize  more  the  development  of  internal  stand- 
ards). It  is  interesting  to  note  that  the  incidence  of  Harvey's  System  4 
(7%)  is  roughly  equivalent  to  that  of  Schroder's  complex  Ss  (4%  to  1Z%, 
where  "complex"  is  defined  as  very  slightly  above  a score  of  3,  and  where 
a score  of  3 represents  Level  II).  Harvey's  System  3 has  no  obvious  corre- 
spondence with  Schroder's  scoring  system,  but  it  would  not  be  likely  to  be 
scored  higher  than  Z,  i.e.  it  would  still  be  simple  in  Schroder's  thinking. 

If  reasonable,  this  comparison  suggests  that  the  two  men  are  dealing 
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with  different  portions  of  the  simple -complex  continuum,  Harvey's  dimen- 
sion  being  a portion  of  Schroder's.  Harvey,  presumably,  is  then  spreading 
out  his  segment,  i.e.  making  finer  discriminations'within  it,  than  does 
Schroder o Any  such  comparison,  however,  must  be  in  spite  of  the  fact  that 
the  scoring  criteria  for  the  two  measures  are  radically  different. 

i 

A second  important  difference  between  the  theorists  seems  to  have  to 
do  with  their  respective  visions  of  the  purpose  or  value  of  their  variables, 
or  with  the  way  in  which  they  manifest  themselves  in  behavior.  I think 
Harvey  wants  to  identify  a characteristic  with  broad  implications,  one  which 
affects  a good  bit  of  the  range  of  daily  behavior,  a variable  which,  when  a 

i 

person's  standing  on  it  is  known,  will  convey  considerable  knowledge  as  to 

his  typical  style.  Schroder,  on  the  other  hand,  is  primarily  attending  to 

information  processing  and  decision  making  in  complex  situations.  The 

kind  of  complexity  with  which  he  is  concerned  is  not  engaged  in  typical  be- 
1 

havior,  and  he  goes  to  considerable  effort  to  devise  tasks  which  are  demand- 
ing enough  that  complexity  as  he  views  it  will  begin  to  play  a role. 

The  last  major  difference  we  will  consider  is  that  of  scoring  criteria, 
where  differences  are  so  striking  as  to  scarcely  need  reiteration.  In  brief, 
distinctions  among  Harvey's  first  three  systems  are  based  on  content, 
whereas  Schroder  can  legitimately  claim  to  be  oriented  toward  structure. 

This  criterion  issue  reflects  on  theory,  in  that  it  results  from  Harvey's 
reliance  on  a theory  which  essentially  specifies  content,  while  Schroder's 
restricts  itself  to  structural  considerations.  This,  in  turn,  means  that 
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Schroder's  position  is  more  easily  adapted  to  new  circumstances  and  is 
aesthetically  more  attractive  in  adaption  than  Harvey's,  since  the  latter's 
is  so  intimately  associated  with  the  interpersonal  domain  that  one  must 
either  refer  back  to  the  interpersonal  content  (in  the  domain  "History,  " one 
who  relies  on  the  writings  of  famous  historians  for  his  opinions  is  System  1) 
or  attempt  to  analogize  from  it  (in  "History,  " what  corresponds  to  need 
affiliative,  that  is  Level  3,  behavior?  )0 

Apparently,  then,  Schroder  and  Harvey  vary  in  their  definition  of 
complexity  and  the  dimension  of  complexity;  they  vary  in  their  research 
interests  and  thus  in  their  choice  of  dependent  variables;  and  they  vary  in 
both  the  definition  of  scoring  criteria  for  their  tests  and  in  the  nature  of  the 
theories  underlying  their  positions  . 


The  Present  Study 

The  purposes  of  the  present  study  are  as  follows; 

A.  To  compare  TIB  and  PCT  scores  across  a broad  range  of  S_s  . 

The  implication  of  the  discussion  above  is  that  the  two  sets  of  scores  should 
be  nearly  independent. 

B.  To  compare  both  scores  with  measures  of  intelligence.  Harvey 
reports  that  TIB  scores  are  related  to  Vocabulary  and  to  Verbal  IQ. 
Schroder  states  that  the  PCT  should  not  be  correlated  with  vocabulary,  but 
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should  be  correlated  with  components  of  intelligence  involving  complex 
problem  solving-  A test  of  the  former,  and  possibly  of  the  latter,  is  pos- 
sible  with  the  data  at  hand. 

C.  To  compare  males  and  females  on  complexity.  Neither  theory 
discusses  sex  differences,  and,  so  far  as  I know,  neither  researcher  has 
looked  for  such  differences-  Gardiner  (1968)  found  girls  to  be  more  com" 
plex  than  boys,  which  he  explained  in  terms  of  child-rearing  practices. 

D.  T o compare  samples  drawn  from  different  populations.  Most  of 
Schroder's  work  is  done  at  Princeton,  and  much  of  Harvey's  at  Colorado. 
The  present  study  will  sample  from  populations  rather  different  from  those 
typically  used  by  Schroder  and  Harvey. 


Chapter  IV 


METHOD 


Subjects 

Approximately  75  female  and  75  male  subjects  were  sought  from 
each  of  three  schools.  Johnson  C.  Smith  University  (Smith)  is  a private 
Negro  college  of  about  1200  students  drawn  largely  from  the  South.  SAT 
scores  typically  average  around  350.  Data  were  collected  from  72  women 
and  76  men.  The  University  of  North  Carolina  at  Charlotte  (UNC-C)  is  a 
branch  of  the  state  university  system,  nominally  integrated  but  preponder- 
antly  white,  currently  operated  on  a nonresidential  basis.  Due  to  unantici- 
pated difficulties  in  collecting  data,  the  UNC-C  sample  consists  of  subjects 
tested  during  the  spring  semester  and  also  during  the  summer  session, 
plus  subjects  from  Catawba  College  nearby,  thought  to  be  comparable  re- 
garding SAT  scores.  Information  was  not  available,  but  SAT  scores  were 
expected  to  be  around  500  for  both  schools.  Data  for  38  females  were  col- 
lected during  the  UNC-C  spring  semester,  for  30  females  during  the  sum- 
mer session,  and  for  4 further  females  during  Catawba's  summer  session, 
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a total  of  72.  UNC-C's  spring  semester  yielded  17  males,  the  summer  ses- 
sion 25  more,  and  Catawba's  summer  session  the  final  27  (total  69).  Stu- 
dents at  both  institutions  were  mostly  southern,  but  the  proportion  from 
northern  states  appeared  much  higher  for  Catawba  than  for  UNC-C. 

The  third  school,  Duke  University,  is  also  integrated,  but  predomi- 
nantly white.  Duke  is  private,  with  about  4000  undergraduates.  It  draws 
as  heavily  on  the  North  for  students  as  on  the  South,  and  entrance  require- 

v. 

ments,  as  largely  defined  by  SAT  scores,  are  high.  Freshman  averages 
being  generally  around  625.  Seventy-six  women  and  75  men  from  Duke 
participated. 

Subjects  at  Smith  and  UNC-C  (and  Catawba)  were  paid  $1.00  for  their 
services.  Ss  at  Duke  were  given  the  choice  of  $1.00  or  one  hour  credit  to- 
ward the  six  required  in  their  Introductory  Psychology  course.  Virtually 
everyone  chose  the  hour  credit. 

Measures  and  Procedure 

Three  tests  were  administered  in  the  experimental  session,  the 
Shipley-Hartford  Test  of  Vocabulary  and  Abstract  Thinking  (SH),  Schroder's 
Paragraph  Completion  Test  (PCT),  and  Harvey's  "This  I Believe"  Test 
(TIB).  The  PCT  and  TIB  were  combined  so  as  to  appear  to  be  a single 
task.  Additionally,  Ss  signed  a form  giving  E permission  to  obtain  SAT 
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scores  from  the  school's  records,  and  gave  such  information  as  age,  class 
in  college,  and  home  address.  SAT's  proved  to  be  unavailable  for  many 
Ss,  especially  those  in  summer  school.  Not  surprisingly,  the  highest  pro- 
portion of  SAT  scores  was  for  the  Duke  samples. 

Actual  testing  occupied  something  under  an  hour.  Number  of  Ss  per 
session  varied  between  3 and  about  35.  At  the  conclusion  of  the  testing, 
an  explanation  was  offered  those  who  were  interested,  and  in  some  cases 
these  conversations  were  prolonged.  All  tests  were  timed.  The  SH  was 
administered  first,  5 minutes  being  given  for  the  40  multiple -choice  vo- 
cabulary items,  then  10  minutes  for  the  20  abstraction  items.  Instructions 
for  the  former  are  standard  for  the  type  test  involved.  Items  for  the  test 
of  ability  to  abstract  consist  of  a series  of  numbers,  letters  or  words  ar- 
ranged according  to  an  organizing  principle.  S is  to  deduce  the  principle 
and  complete  the  last  item  in  the  sequence.  (In  the  item  "tarn  tan  rib  rid 
rat  raw  hip  , " for  example,  the  correct  response  is  hit.  ) 

The  PCT-TIB  combination  followed.  Instructions  and  items  appear 
in  the  Appendix.  The  respective  sets  of  items  appeared  together,  and  the 
sequence  in  which  they  were  offered  was  alternated.  One  item  appeared  on 
each  page,  and  each  item  was  timed.  In  the  present  case,  the  time  limit 
was  130  seconds,  which  is  toward  the  upper  end  of  the  range  of  limits  sug- 
gested by  Schroder  and  Harvey.  There  were  16  items;  10  from  the  TIB, 

6 from  the  PCT  (the  first  of  which  is  a warm-up,  nonscored  item).  The 
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PCT  items  are  fairly  well  standardized.  More  variation  exists  in  the  se- 
lection of  TIB  items. 

Scoring 

The  SH  is  scored  against  a key  and  produces  four  measures:  vocab- 

ulary (SH~V),  abstraction  (SH-A),  total  (SH-T,  the  sum  of  SH-V  and  SH-A), 
and  the  "Conceptual  Quotient"  (SH-CQ)  which  is  the  ratio  between  SH-V  and 
SH-A  (CQ  = (SH-A/SH-V)  X 100).  The  CQ  reflects  the  original  intention 
that  the  test  be  used  as  a rough  screening  device  for  detecting  the  type  of 
cognitive  impairment  likely  to  be  experienced  by  mental  patients,  based  on 
the  assumption  that  vocabulary,  a function  of  memory,  will  remain  rela- 
tively stable,  while  the  active  skill  involved  in  abstracting  will  be  an  early 
loss.  Thus,  a low  CQ  results  from  a relatively  high  SH-V  score  and  may 
indicate  a loss  in  abstract  functioning.  To  the  extent  that  CQ  may  be  re- 
lated to  analytic  thinking,  it  is  of  interest  in  the  present  case  because  of 
the  possibility,  based  on  Gardiner  and  Vannoy,  of  a relationship  between 
this  kind  of  thinking  and  complexity. 

The  PCT's  were  scored  by  Schroder’s  research  assistants  at  his 
laboratory  in  Princeton.  Schroder  reports  (Schroder  et  al . , 1967)  that  re- 
liabilities for  his  raters  range  between  .80  and  .95.  Each  item  was  award- 
ed a score  from  1 to  7 as  described  previously.  The  score  used  in  the 
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analysis  was  the  sum  of  the  two  highest  item  scores,  which  could  range  be- 
tween 2 and  14.  In  practice,  out  of  the  total  440  subjects,  no  item  received 
a score  above  4,  and  no  final  score  exceeded  7.  Some  items  are  unscor- 
able,  in  which  case  they  are  given  a zero.  If  two  or  more  zeros  occur  in 
the  protocol,  no  final  score  is  given.  This  occurred  on  approximately  6% 

i 

of  the  protocols.  Two  sets  of  PCT  scores  were  used,  the  first  derived  as 

i 

just  described.  The  second  set  may  be  termed  the  Forced  PCT  (PCT  -F) 
scores.  In  this  case,  subjects  who  had  previously  received  zero  scores 

I 

were  given  scores  based  on  whatever  number  of  items  were  available, 


these  scores  being,  again,  the  sum  of  the  two  highest  item  scores. 

Two  previously  inexperienced  raters  proceeded  with  the  TIB's  as 
follows:  Basing  procedures  and  criteria  on  mimeographed  material  sent 


from  Colorado  by  Harvey,  approximately  90  of  the  protocols  at  hand  were 
used  for  training  purposes.  The  number  was  so  large  because  it  proved 
exceedingly  difficult  to  establish  reliable  criteria.  The  protocols  involved 
were  then  rerated  at  a later  time.  After  training,  the  judges  independently 
rated  a sample  of  107  protocols.  Of  this  number,  one  or  the  other  or  both 
judges  gave  zeros  to  30  Ss,  a 28%  unscorable  rate,  comparable  to  Harvey's. 
A contingency  table  approximation  of  chi-square  for  agreeing  or  not  agree- 
ing to  score  or  not  score  protocols  produced  a value  of  15.95,  significant 
at  well  beyond  the  .001  level,  suggesting  that  there  was  agreement  between 
the  judges  as  to  which  subjects  were  and  were  not  scorable.  The  correla- 
tion between  the  judges  on  the  remaining  77  S_s  was  .83.  Since  this  was 
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regarded  as  an  adequate  indication  of  reliability,  one  judge  rated  all  the 
remaining  protocols,  and  her  ratings  were  used  in  the  analysis. 

Throughout  this  process  there  was  one  major  and  intentional  devia- 

t 

tion  from  Harvey's  recommendations.  He  advises  that  a single  global 
score  be  given  the  subject  instead  of  individual  item  scores.  Since  item 
scores  do  not  preclude  global  scores,  and  since  they  would  seem  to  permit 
greater  flexibility  in  scoring  as  well  as  tighter  control  over  biases,  item 
scores  were  assigned  throughout.  Global  scores,  then,  were  based  on  the 
frequency  of  the  item  scores,  ameliorated  if  it  seemed  appropriate,  but  as 
seldom  as  possible,  by  the  judgment  of  the  rater.  This  was  the  Global 
(TIB-G)  TIB  score. 

Three  further  scores  were  baaed  on  the  TIB.  First,  the  items  in  the 
test  are  rather  heterogeneous  in  nature.  In  spite  of  the  fact  that  Harvey 
seems  to  feel  that  he  is  measuring  interpersonal  complexity,  the  relevancy 
of  some  items  to  interpersonal  matters  is  not  always  obvious.  According- 
ly,  the  scores  on  the  5 items  which  seemed  most  immediately  interpersonal 
(marked  with  an  asterisk  in  the  Appendix)  were  considered  separately,  and 
an  Interpersonal  (I)  score  given  on  the  basis  of  the  item  scores.  As  with  G, 
of  course,  some  of  these  were  unscorable. 

The  two  remaining  scores  are  forced  versions  of  G and  I.  That  is, 
wherever  a zero  value  was  given,  the  judge  was  required  to  attempt  to 
determine  a single  rating  in  addition.  In  general,  this  proved  possible  (al- 
though distasteful),  but  more  difficult  for  the  I than  the  G scores. 
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These  scores,  then,  plus  SAT  scores  from  the  college  records  of- 
fices  for  those  students  for  whom  they  were  on  file,  and  the  information  con- 
cerning age,  college  class,  and  home  (residence)  address,  were  used  in  the 


analyses  to  follow,, 


j 

Chapter  V 
RESULTS 

The  fact  that  n's  are  typically  uneven  makes  especially  relevant  the 
question  of  homogeneity  of  variance.  Although  the  Fmax  test  for  homo- 
geneity is  intended  for  equal  n's,  Winer  (1962)  states  that  Fmax  may  be 
applied  to  unequal  n’s  if  one  uses  the  n of  the  largest  sample  in  the  analy- 

i 

sis.  In  the  case  of  nonhomogeneous  variances  plus  unequal  n's,  the  rule 
of  thumb  (Box,  1954a,  1954b)  seems  to  be  that,  if  the  larger  variances  be- 
long to  the  larger  sample,  the  probability  levels  for  the  F-ratios  will  be 
conservative,  whereas  if  the  larger  variances  are  based  on  the  smaller 
samples,  the  probabilities  will  be  overstated.  The  size  of  the  samples  in- 
volved is  not  so  important  as  the  ratios  of  their  sizes.  Thus,  approximate- 
ly equal  n's  appear  to  retain  some  robustness.  However,  when  large  dif- 
ferences in  sample  variances  exist  along  with  unequal  n's,  relationships 
become  rather  complex  and  "true"  probabilities  extremely  difficult  to  come 
by.  Unfortunately,  this  situation  exists  in  some  of  the  preliminary  analy- 
ses below.  Where  important  it  will  be  noted.  The  two-way  analyses  of 
variance  have  been  computed  using  the  least  squares  procedure  for  unequal 
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n's  outlined  by  Winer  (1952,  p.  291  ff . ) where  one  factor  has  only  two 
levels . 

Means  and  standard  deviations  for  the  variables  are  presented  in 
Table  1„  Protocols  were  94%  scorable  for  the  PCT,  85%  for  the  TIB-G, 
and  81%  for  the  TIB-I.  (The  "unscorable"  rate  for  the  TIB  is  lower  here 
than  Harvey  reports,  and  lower  than  in  the  preliminary  reliability  check 
here.  This  is  because  scores  for  only  one  rater  are  used.  "Unscorable" 
decisions,  although  significantly  related,  do  not  overlap  perfectly.  Thus, 
the  more  raters,  the  more  rejects.)  For  the  PCT,  the  percentages  of 
scores  from  2 through  7 were,  respectively,  3,  9,  39s  28,  12,  and  3.  For 
the  TIB-G,  percentages  for  scores  1 to  4 were  36,  15,  26,  and  8,  and  for 
the  TIB-1,  31,  12,  31,  and  7. 

The  various  forced  scales  correlated  perfectly  with  their  corre- 
sponding nonforced  values,  since  the  only  discrepancies  between  the  mem- 
bers of  a pair  occur  when  an  S received  a zero  score  in  the  nonforced  case, 
and  these  are  removed  from  the  correlation.  Since  four  scores  are  de- 
rived from  the  TIB,  however,  four  pairings  may  deviate  from  1.0.  These 
were  calculated  separately  for  each  of  the  six  samples  and  also  for  the 
combined  sample  (Table  2).  * The  range  for  these  28  correlations  was 
from  . 7 1 to  .87,  with  22  of  them  equalling  or  exceeding  .80.  In  short, 

^These  and  the  other  analyses  herein  reported  were  run  at  the  Duke 
University  Computer  Center,  using  programs  written  by  the  author,  and 
funds  provided  by  the  University. 
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Table  1 


Means,  Standard  Deviations  and  N's 


Age 

Shipley-Hartford 
Vocab.  Abs. 

T otal 

CQ 

Smith  Females 

X 

19.  5 

28.  2 

29.  2 

57.4 

99.0 

s 

1.  15 

3.  50 

5.  11 

7.42 

10.  6 

n 

72 

70 

70 

70 

70 

Smith  Males 

X 

19.  8 

27. 5 

29.  3 

56.  8 

100.9 

s 

1.90 

3.47 

6.00 

7.  98 

13.  1 

n 

76 

76 

76 

76 

76 

UNC-C  Females 

X 

22.  6 

31.5 

33.  6 

65.  1 

102.  2 

s 

6.  79 

3.  32 

3.  57 

4.  79 

10.7 

n 

72 

72 

72 

72 

72 

UNC-C  Males 

X 

21.4 

31.  6 

33. 3 

64.  9 

100.8 

s 

3.  84 

3.77 

4.  77 

7.  32 

10.0 

n 

69 

69 

69 

69 

69 

Duke  Females 

X 

18.7 

34.  2 

36.  3 

70.  6 

102.  1 

s 

2.46 

2.49 

2.  64 

3.  69 

7. 87 

n 

74 

76 

76 

76 

76 

Duke  Males 

X 

18.  7 

34.  3 

36.  5 

70. 8 

102.  0 

s 

1.  18 

2.  28 

2.  86 

4.  10 

6.  88 

n 

75 

75 

75 

75 

75 

Combined 

X 

20.  1 

31.3 

33.  1 

64.  3 

101. 2 

s 

3.  74 

4.  12 

5.  22 

8. 27 

10.  1 

n 

438 

438 

438 

438 

438 

N's  vary  because  of  missing  data  or  unscorable  protocols. 


Abbreviations : 
SH-V 
SH -A 
SH-T 
SH-CQ 
SAT-V 
SAT-M 


Shipley -Hartford  Vocabulary 
M " Abstraction 

" " Total  (V  + A) 

" " Conceptual  Quotient  ((A  V)  x 100) 

Scholastic  Aptitude  Test--Verbal 

" " " --Mathematical 


j 

I 

t 

I 


j 


PCT 

PCT-F 

TIB-G 

TIB-GF 

TIB -I 

TIB-1F 


Paragraph  Completion  Test--Score 

" " --Forced  Score 

"This  I Believe"  Test--Global  Score 

" --Forced  Global  Score 
" " " " --Interpersonal  Score 

" " " " --Forced  Interpersonal  Score 


Table  1 (continued) 


y 

SAT 

M 

PCT 

PCT -F  PCT 

G-F 

TIB 

G 

I-F 

I 

386 

372 

4.  15 

4.  16 

1.94 

1. 98 

2.  22 

2.  29 

83 

75 

. 85 

. 86 

.98 

.99 

.99 

.99 

54 

54 

72 

68 

71 

55 

60 

56 

374 

400 

4.  34 

4.  37 

1.86 

1.73 

1.75 

1.75 

86 

83 

o 95 

.96 

. 88 

. 85 

. 87 

.89 

48 

48 

76 

68 

76 

59 

64 

55 

474 

477 

4.  50 

4.  52 

1.86 

1. 78 

1. 96 

1. 98 

71 

81 

.89 

. 88 

1.05 

1.03 

1. 06 

1.06 

50 

50 

72 

67 

72 

64 

71 

62 

468 

509 

4.41 

4.  36 

1.88 

1. 97 

2.  08 

2.  09 

97 

80 

1.  19 

1.  20 

1.01 

1.02 

1.03 

1.06 

49 

49 

69 

66 

69 

59 

64 

53 

614 

621 

4.  58 

4.  63 

2.40 

2.40 

2.  50 

2.  52 

88 

74 

1.06 

1.05 

1.  10 

1.11 

1.09 

1.11 

7 5 

75 

76 

71 

76 

70 

7 1 

66 

615 

663 

4.  88 

4.  88 

2.  35 

2.  39 

2.  39 

2.  39 

74 

73 

1.  16 

1.  15 

1.01 

1. 04 

. 92 

.93 

71 

71 

75 

72 

75 

66 

70 

65 

505 

524 

4.48 

4.49 

2.  05 

2.  06 

2.  15 

2.  18 

130 

134 

1. 04 

1. 04 

1.03 

1.04 

1.03 

1. 04 

347 

347 

440 

412 

439 

373 

400 

357 
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they  are  highly  comparable.  Parallel  analyses  were  run  on  all  six  scores 
in  nearly  all  cases,  but  with  rare  exceptions,  parallel  results  were  found 
for  scores  deriving  from  the  same  test.  Accordingly,  unless  the  others 
are  of  special  interest,  only  the  results  for  the  PCT  and  TIB-G  scores 
(i„e.  the  scores  as  used  by  their  respective  creators)  will  be  considered. 
All  t tests  and  correlations  are  two-tailed. 


Preliminary  Tests 

The  initial  question  asked  of  the  data  was  whether  the  sequence  in 
which  the  Ss  took  the  PCT  and  TIB  affected  their  scores,  Ss  were  divided 
according  to  which  test  they  had  taken  first,  and  the  two  groups  compared 
on  each  of  the  complexity  scores  by  Jt^  tests.  These  were  uniformly  non- 
significant, The  sequence  in  which  the  PCT  and  TIB  were  taken  did  not 
affect  the  scores  (see  Table  3), 

The  second  issue  was  whether  the  UNC-C  sample,  drawn  as  it  was 
from  several  sources,  was  homogeneous  enough  to  be  regarded  as  a single 
sample.  Since  the  males  were  relatively  evenly  distributed  among  the 
three  sources  (n's  were  17,  25,  27),  one-way  analyses  of  variance  were 
calculated  on  the  complexity  scores  and  on  each  of  the  six  intelligence 
scores  (Table  4).  These  tests  were  all  nonsignificant,  so  there  is  no  ob- 
stacle to  regarding  the  UNC-C  males  as  representing  a single  population. 
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Fmax  significantly  large  for  the  three  groups  in  the  analysis  of  SAT -V 
scores,  and  the  larger  of  the  variances  is  based  on  the  smaller  n.  Even 
though  this  combination  is  likely  to  lead  to  an  unrealistically  improbable 
result  (Box,  1954a),  the  ANOVA  F is  nonsignificant. 

Since  only  four  female  Ss  were  from  Catawba,  these  were  combined 
with  the  UNC-C  female  summer  session  Ss,  and  t tests  were  used  to  com- 
pare this  group  with  the  UNC-C  female  spring  semester  Ss.  In  this  case 
the  difference  between  the  two  groups  on  their  PCT  scores  was  significant 
at  a probability  level  between  .01  and  .02,  the  summer  session  women 
having  the  higher  average  (Table  4).  The  difference  between  the  two  groups 
on  their  TIB-G  scores  has  an  associated  probability  between  .06  and  .07, 
which  is  generally  considered  nonsignificant,  but  is  of  interest  here  for  two 
reasons:  First,  the  two  groups  had  different  variances  on  TIB-G,  and 

since  the  larger  variance  was  based  on  the  larger  n,  the  probability  of  the 
difference  may  be  slightly  conservative  (the  n's  are  not  greatly  different). 
Second,  the  direction  of  the  difference  is  opposite  that  on  the  PCT,  i.e.  the 
spring  semester  students  have  the  higher  average. 

The  same  two  groups  were  compared  on  the  intelligence  measures. 
The  two  groups  differed  significantly  (.01  < p < .02)  only  on  the  SH-V  mea- 
sure, the  difference  favoring  the  summer  school  students. 

Thus,  while  the  UNC-C  male  sample  may  be  treated  as  a single  unit, 
there  is  reason  to  doubt  the  homogeneity  of  the  female  group.  Analyses  in 
which  this  might  prove  important  were  generally  run  three  times,  once 
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with  the  combined  UNC-C  female  sample,  and  once  with  each  of  the  sub 
groups „ 


Complexity,  Age,  College  Class,  Sex  and  Residence 

Since  complexity  is  often  discussed  in  developmental  terms,  and 
since  roughly  20%  of  the  UNC-C  sample  (combining  males  and  females) 
were  older  than  normal  for  undergraduates,  a series  of  t tests  were  run  on 
complexity  scores  to  check  for  age  effects.  In  this  situation,  23  or  over 
was  nold"  and  under  23  "young,  11  These  tests  were  all  nonsignificant 
(Table  3,  p.  63). 

As  a second  rough  check  for  age  effects  (rough  since  clearly  in  this 
case,  as  well  as  the  one  above,  selection  factors  other  than  age  are  also 
operating)  Sex  x Cias  s -in~College  two-way  analyses  were  run  separately 
for  the  Smith  and  UNC-C  samples,  class  being  considered  lower  (Fresh- 
man and  Sophomore)  vs.  upper  (Junior  and  Senior)  „ No  such  analysis  was 
performed  on  the  Duke  sample  because  there  were  too  few  upperclassmen. 

For  the  UNC-C  sample,  a class  effect  existed  such  that  upperclass- 
men were  more  complex  than  lower  on  four  of  the  six  complexity  mea- 
sures, including  the  PCT  and  TIB-G  (Table  3)„  Fmax  is  nearly  significant 
at  the  .05  level  for  the  PCT  analysis,  but  again  large  variances  are  based 
on  large  n's,  so  the  results  are  probably  conservative.  In  this  case,  older 
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students,  as  defined  by  college  experience,  are  more  complex  than  younger, 
a finding  perhaps  supportive  of  a developmental  positiom  (Apropos  the 
previous  finding  of  nonsimilarity  between  the  two  female  groups,  the  spring 
Ss  had  9 upperclassmen,  the  summer  group  13.  The  difference,  even  ex- 
pressed as  proportions,  is  not  large,,  Such  as  it  is,  the  difference  is  con- 
sistent  with  the  pair  of  results  for  the  PCT  and  contradictory  for  the  TIB-G 
findings  . ) 

The  college  class  difference  was  not  replicated  in  the  Smith  sample, 
however  (Table  3,  p.  63)  . In  this  case,  there  were  no  class  effects  what- 
soever,, This  analysis,  however,  constitutes  the  sole  instance  in  which  the 
TIB-G  and  TIB-I  provided  results  different  from  each  other  in  an  interest- 
ing way.  The  sex  effects  are  highly  significant  for  the  TIB-I  and  not  at  all 
so  for  the  TIB-G,  this  despite  their  correlations  of  .83  and  .84  for  the  two 
samples  from  Smith.  The  difference  between  the  sexes,  females  having 
the  higher  average,  seems  to  result  from  the  fact  that  a fairly  large  pro- 
portion of  the  Smith  males  express  severe  doubts  about  the  permanence  and 
reliability  of  friendship,  such  distrust  and  cynicism  leading  to  scores  of  1 
or  2 in  Harvey's  system.  This  finding  did  not  appear  in  the  UNC-C  Class  x 
Sex  analysis  (nor  in  the  Duke  ANOVA,  below),  so  it  represents  either  a 
minority  group  phenomenon  (not  an  implausible  hypothesis)  or  sampling  er- 
ror. 

Duke's  selection  practices  permit  a test  of  whether  residence  in  the 
South  or  North  affects  complexity  scores.  The  question  is  relevant  because 
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Southern  education  is  often  regarded  as  inferior.  Accordingly,  a Residence 
x Sex  ANOVA  was  run.  Neither  sex  nor  residence  produced  significant  re- 
sults (Table  3,  p.  63),  Apparently,  either  residence  is  unimportant,  or 
becomes  so  when  intelligence  is  high.  (No  check  was  made  on  the  compar- 
ability of  intelligence  scores  for  the  two  residence  groups.  For  the  Duke 
women,  several  of  the  intelligence  scores  did  correlate  significantly  with 
the  TIB -G, ) 


Complexity,  Schools  and  Intelligence 

College  x Sex  ANOVA1  s were  run  for  both  intelligence  and  complexity 
scores.  Since  the  heterogeneity  of  the  UNC-C  female  (UF)  sample  might 
be  important  here,  each  analysis  was  done  three  times,  once  each  with  the 
combined  UF  sample,  the  UF-spring  subsample,  and  the  UF-summer  sub- 
sample. For  the  combined  UF  sample,  n's  in  the  analyses  tend  to  be  ap- 
proximately equal.  The  subsamples  make  the  n's  fairly  unequal.  There  is 
no  serious  problem  for  the  complexity  ANOVA's  because  variances  are 
homogeneous.  For  the  UF-spring  subsample,  Fmax  is  borderline  signifi- 
cant for  the  PCT,  but  the  large  variance  and  large  n's  go  together,  mitigat- 
ing the  problem. 

The  situation  is  otherwise  for  the  intelligence  score  comparisons. 

For  all  three  UF  samples  all  variances  based  on  the  Shipley-Hartford  are 


69 


significantly  (or  nearly  so)  different,  and  in  at  least  one  of  the  cases  SAT -V 
is  also  nonhomogeneous  „ For  two  of  the  variations  either  sample  sizes  are 
roughly  equal  (combined  UF),  or  there  is  a favorable  combination  of  sample 
size  and  variance  magnitude  (UF-'spring)  , Not  even  that  much  can  be  said 
for  the  UF»summer  sa.mple„  However,  the  pattern  of  ANOVA  results  is 
identical  in  all  three  cases,  and  is  in  accord  with  expectations  and  selec- 
tion  criteria:  There  is  an  enormous  intelligence  effect  across  schools, 

F's  range  from  approximately  100  to  approximately  350,  The  only  intelli- 
gence-based measure  not  related  to  schools  is  the  SH  Conceptual  Quotient 
(Abstraction  divided  by  Vocabulary  and  multiplied  by  100),  Additionally, 
there  is  a sex  main  effect  on  SAT -M,  males,  as  would  be  expected,  being 
higher  (Table  5), 

The  results  for  complexity  are  similar  but  not  so  overwhelming 
(Table  6),  All  complexity  scores  for  all  three  analyses  show  significant 
school  effects.  For  the  combined  UF  sample,  only  the  school  effects  are 
significant.  For  the  subsamples,  additional  significant  results  occur  in  a 
pattern  which  follows  logically  from  the  differences  in  their  scores:  When 

the  UF-spring  group  was  used  (who  were  lower  on  the  PCT  and  higher  on 
the  TIB  than  were  the  UF-summer  Ss),  significant  sex  effects  appear,  with 
males  averaging  higher  on  the  PCT  and  females  higher  on  the  TIB,  With 
the  UF-summer  group,  sex  effects  disappear,  and  are  replaced  with  inter- 
actions: On  the  PCT,  Smith  males,  UNC-C  females  and  Duke  males  have 

higher  means  than  the  opposite  sexes  at  their  respective  schools.  The 
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Table  6 (continued) 
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direction  of  the  interaction  is  reversed  on  the  TIB,  and  does  not  quite  reach 
significance  for  the  TIB-G  scores,  For  TIB-I,  however.  Smith  females, 
UNC-C  males  and  Duke  females  have  the  higher  means,  and  the  interaction 
is  significant. 

All  three  complexity  scores  (PCT,  TIB-G,  TIB-I)  provided  signifi- 
cant school  effects,  and  in  each  case  the  mean  for  Smith  was  lowest  and 
that  for  Duke  highest.  For  the  two  TIB  scores  the  differences  between 
Smith  and  UNC-C  were  extremely  small;  but  on  the  PCT,  UNC-C  is  more 
nearly  in  the  middle.  In  view  of  the  difference  between  Smith  and  UNC-C 
on  SAT  scores  (approximately  100  points)  and  race,  with  all  the  implica- 

! 

tions  associated  with  the  latter,  it  is  of  particular  interest  to  know  which 

: 

groups  differed  from  each  other.  Accordingly,  for  each  complexity  score 

| 

individual  comparisons  (Winer,  1962)  were  made  for  all  pairs  of  schools  in 

■ 

the  ANOVA  using  the  combined  UNC-C  female  sample.  The  pattern  of  re- 
suits  is  precisely  the  same  for  all  three  scores:  Smith  and  UNC-C  are  not 

i 

different  from  each  other,  but  both  are  different  from  Duke.  This  pattern 
holds  whether  one  selects  critical  values  according  to  "risky"  or  "con- 
servative" principles. 

PCT,  TIB  and  Intelligence 

■ 

Results  regarding  the  relationships  between  the  PCT  and  TIB  (see 


. 


Table  2,  p0  61)  may  be  succinctly  stated:  The  two  tests  yield  orthogonal 
scores.  No  significant  correlations  were  found  for  either  the  PCT  and 
TIB-G  or  the  PCT  and  TIB-I,  for  any  of  the  six  samples  or  for  the  entire 
combined  sample.  The  correlations  ranged  between  07  and  +.21. 

Nor  was  there  support  for  the  earlier  suggestion  that  Harvey's  Sys- 
terns  I and  II,  and  probably  III,  were  discriminations  of  Schroder’s  simple 
scores,  whereas  Harvey's  IV's  were  like  Schroder's  6's  (and  above).  A 
chi-square  testing  this  hypothesis  (TIB  1-3  vs.  4,  PCT  less  than  6 vs . 6 
and  7)  was  nonsignificant. 

Both  the  PCT  and  TIB-G  correlate  more  strongly  with  intelligence 
than  they  do  with  each  other  (Tables  7 and  8).  Further,  the  pattern  of  cor- 
relations with  the  intelligence  measures  is  basically  the  same  for  both 
tests.  In  my  opinion,  there  is  no  great  cause  for  concern,  however.  The 
proportion  of  variance  involved  in  these  correlations  is  never  very  large 
and  is  typically  quite  small  (though  statistically  significant). 

The  most  important  set  of  these  correlations,  of  course,  is  for  the 
combined  samples.  This  provides  a particularly  broad  range  of  intelligence 
scores,  and  an  N which  typically  ranges  between  300  and  400.  In  this  case, 
we  find  that  both  the  PCT  and  the  TIB-G  are  significantly  (p  < .01)  corre- 
lated with  SH-Vocabulary,  SH -Abstraction,  SH-Total,  SAT -Verbal  and 
SAT -Mathematics.  (It  should  be  noted  that  SH-T  is  positively  correlated 
with  SH-V  and  SH-A.)  In  magnitude  the  correlations  range  from  . 14  to  . 27, 
so  at  the  most  about  8%  of  the  variance  is  involved.  A few  of  the  correla- 
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Table  7 


Correlations  Between  Intelligence  Measures  and  the 
Paragraph  Completion  Test  (PCT) 


Vocab. 

Shipley- 
Abs . 

■Hartford 
T otal 

CQ 

SAT 

Verbal 

Math. 

Smith  Females 

r 

. 23 

. 19 

. 24 

. 02 

. 02 

. 08 

n 

66 

66 

66 

66 

50 

50 

Smith  Males 

r 

. 19 

. 09 

. 15 

-.  04 

. 07 

. 07 

n 

68 

68 

68 

68 

46 

46 

UNC-C  Females 

r 

. 30* 

.01 

. 22 

18 

. 10 

. 03 

n 

67 

67 

67 

67 

47 

47 

UNC-C  Males 

r 

. 18 

. 08 

. 14 

05 

. 06 

-.01 

n 

66 

66 

66 

66 

47 

47 

Duke  Females 

r 

-.01 

05 

-.04 

-.  04 

. 04 

. 14 

n 

71 

71 

71 

71 

70 

70 

Duke  Males 

r 

. 18 

. 05 

. 13 

-.07 

. 12 

. 16 

n 

72 

72 

72 

72 

68 

68 

Combined 

r 

. 26*** 

. 16** 

. 23*** 

04 

. 18** 

. 20*** 

Samples 

n 

410 

410 

410 

410 

328 

328 

*p  - . 05 

**p  =.01 
***p  = . 001 


Abbreviations: 
SH- V 
SH  -A 
SH-T 
SH-CQ 
SAT-V 
SAT  -M 


Shipley-Hartford  Vocabulary 
" " Abstraction 

•«  " Total  (V  + A) 

" " Conceptual  Quotient  ((A  *—  V)  x 100) 

Scholastic  Aptitude  Test--Verbal 

" " ” --Mathematical 
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Table  8 


Correlations  Between  Intelligence  Measures  and  the 
"This  I Believe”  Test  (TIB) 


Vocab. 

Shipl-ey-Hartford 
Abs.  Total 

CQ 

SAT 

Verbal  Math 

Smith  Females 

r 

. 07 

03 

.01 

08 

o 17 

. 13 

n 

53 

53 

53 

53 

41 

41 

Smith  Males 

r 

06 

. 33* 

. 22 

.40** 

. 10 

. 26 

n 

59 

59 

59 

59 

38 

38 

UNC“C  Females 

r 

. 26* 

. 08 

. 23 

15 

. 17 

. 26 

n 

64 

64 

64 

64 

44 

44 

UNC-C  Males 

r 

. 19 

“.  02 

.09 

15 

. 10 

. 08 

n 

59 

59 

59 

59 

41 

41 

Duke  Females 

r 

. 23 

24* 

».  02 

“. 35** 

. 24* 

. 02 

n 

70 

70 

70 

70 

69 

69 

Duke  Males 

r 

. 11 

.01 

. 07 

“.07 

. 23 

. 03 

n 

66 

66 

66 

66 

62 

62 

Combined 

r 

. 23**'! 

= . 14** 

. 20*** 

05 

. 27*** 

. 23*** 

Samples 

n 

371 

371 

371 

371 

295 

295 

*p  ~ . 05 


**p  “.01 
***p  “.001 


Abbreviations: 

SH-V 

SH-A 

SH“T 

SH“CQ 

SAT-V 

SAT-M 


Shipley-Hartford  Vocabulary 
” . " Abstraction 

” " Total  (V  + A) 

11  " Conceptual  Quotient  ((A  V)  x 100) 

Scholastic  Aptitude  Test“-Verbal 

" " 11  -“Mathematical 
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tions  for  the  individual  samples  are  larger,  but  even  the  largest  is  only  .40. 

Within  the  samples  no  impelling  pattern  seems  to  exists  In  two  cases, 
Smith  males  and  Duke  females,  SH -Abstraction  and  SH-Conceptual  Quotient 
correlated  significantly  with  TIB-G.  (Again,  SH-A  and  SH-CQ  are  not  in- 
dependent.) However,  for  the  latter  group  the  two  correlations  are  nega- 
tive, while  in  the  former  they  are  positive.  This  ambiguity,  plus  the  weak 
relationship  between  SH-A  and  the  complexity  scores,  makes  unlikely  my 
earlier  suggestion  that  complexity  is  based  on  analytic  thinking,  at  least 
insofar  as  SH-A  or  SH-CQ  measure  this  skill. 

This  weak  relationship  is  consistent  with  Schroder's  prediction  re- 
garding his  PCT  and  the  Abstraction  score.  His  expectation  was  based  on 
two  considerations:  The  SH-A  tasks  are  not  interpersonal  and  they  are  not 

complex  enough  to  fully  engage  complex  thinking.  On  the  other  hand,  his 
prediction  of  nonrelationship  between  the  PCT  and  vocabulary  and/or.  verbal 
scores  is  contradicted  by  the  present  findings.  That  is,  Schroder  argues 
that  if  anything  the  PCT  will  be  related  to  complex  information-processing 
or  decision-making  aspects  of  intelligence,  but  not  to  verbal  skills  per  se, 
and  particularly  not  to  memory  skills  (e.  g.  vocabulary).  In  this  study, 
both  the  PCT  and  the  TIB  are  related  to  vocabulary  and  to  verbal  skills. 

This  finding  is  consistent  with  Harvey's  report  (1966). 

Finally,  since  Harvey  typically  analyzes  his  data  using  group  means 
instead  of  correlations,  in  each  of  the  six  samples  Ss  were  grouped  by  each 
complexity  score  (separately)  and  one-way  ANOVA's  run  for  each  of  the 
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intelligence  scores  and  for  age  (four  groups  were  used  for  the  PCT  also: 
scores  of  2-3,  4,  5,  6-7).  No  surprising  results  were  discovered,,  In  a 
few  cases  (four),  significant  correlations  between  complexity  and  intelli- 
gence had  been  found,  but  the  present  F ratios  were  nonsignificant,.  This  is 
presumably  due  to  greater  power  on  the  part  of  the  correlation  in  testing  a 
linear  hypothesis,  plus  its  lack  of  concern  for  unequal  n's,  etc.  In  three 
instances  the  correlations  and  F's  were  both  significant  There  were  six 
significant  F/nonsignificant  r combinations  „ Three  of  these  were  for 
groups  as  defined  by  TIB-I  scores,  for  which  correlations  had  not  been  cal- 
culated,, No  particular  pattern  appears  for  these  results,  except  that  on 
the  TIB  there  is  some  tendency  for  Levels  II  and  IV  to  be  alike  and  I and  III 
to  be  alike,,  This  latter  pattern,  but  not  the  linear  findings,  are  consistent 
with  Harvey  (1966)„ 


Reliability 

Of  course,  the  best  way  to  learn  about  the  test-retest  reliability  of 
these  tests  would  be  to  devise  alternate  forms  and  administer  both  versions 
to  some  Ssc  Lacking  this,  an  estimate  of  the  reliability,  coefficient  alpha 
(R  ozeboom,  1966),  can  be  calculated  on  the  basis  of  the  variances  of  the 
items  and  of  the  total  scores.  The  scoring  procedures  used  above,  how- 
ever, do  not  lead  to  values  which  can  be  used  to  calculate  alpha.  Accord- 


- 
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ingly,  the  PCT  and  TIB  item  scores  were  used  to  derive  total  scores  com- 
patible with  the  application  of  alpha.,  In  each  case*  the  average  of  the 
scored  items  was  found,  and  this  value  multiplied  by  the  total  number  of 
items  (5  and  10,  respectively).  The  variance  of  these  scores  was  used  in 
finding  the  alpha  coefficients. 

In  general,  these  scores  (SumPCT  and  SumTIB)  behaved  as  the  other 
scores.  Summed  scores  generally  correlated  in  the  ,80's  with  the  corre- 
sponding regular  scores  (SumTIB  with  TIB-I  being  mostly  in  the  ,70's). 
Also,  the  summed  scores  tended  not  to  correlate  significantly  with  scores 
based  on  the  other  test.  Here,  however,  there  were  exceptions.  For  the 
Smith  females,  SumPCT  correlated  . 235  with  SumTIB  and  . 241  with  TIB-G 
both  significant  at  the  „ 05  level;  and  in  the  Duke  male  sample,  SumPCT 
barely  misses  significance  with  TIB-I  (r  = .231),  With  the  combined 
samples,  none  of  the  intertest  correlations  approach  statistical  signifi- 
cance. 

For  the  combined  samples,  the  estimates  of  test-retest  reliability 
are  .406  for  the  PCT  and  .776  for  the  TIB.  Within  the  samples,  the  val- 
ues for  the  PCT  range  between  . 294  and  . 515,  and  between  . 605  and  . 814 
for  the  TIB  (Table  9).  On  these  grounds  the  TIB  can  be  regarded  as  reli- 


able, and  the  PCT  cannot. 
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Chapter  VI 


DISCUSSION 
Summary  of  Results 

The  major  purposes  of  this  study  were  outlined  at  the  close  of  Chap- 
ter III.  The  relevant  results  are  as  follows: 

A.  The  main  point  of  interest  seems  to  have  been  unequivocally  re- 
solved: Schroder’s  PCT  and  Harvey's  TIB  are  uncorrelated  with  each 

other,  and  are  therefore  not  measuring  the  same  variable.  Even  when  the 
TIB  scoring  is  revised  to  make  it  more  obviously  interpersonal  in  orienta- 
tion (and  it  should  be  emphasized  that  even  the  global  score  is  assumed  to 
be  interpersonal),  no  relationship  with  the  PCT  exists. 

The  current  data  cannot  tell  us  whether  this  independence  is  based  on 
the  failure  of  one  test  to  measure  what  it  is  supposed  to,  or  whether  one  of 
the  measures  is  "better"  than  the  other.  As  has  been  shown,  the  scoring 
criteria  are  vastly  different  for  the  two,  but  each  has  a basis  in  the  re- 
spective theory.  The  suggestion  that  the  two  theories  might  be  dealing  with 
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different  portions  of  the  dimension  was  not  substantiated. 

The  most  obvious  consequence  of  this  is  that  the  theories  and  asso- 
ciated research  are  irrelevant  to  each  other.  Irrespective  of  similarities 
in  background,  language,  terminology,  etc,,  it  is  simply  not  appropriate  to 
combine  the  findings  in  attempting  to  define  or  understand  "integrative  com- 
plexity, " 

B,  A second  major  question  is  the  relationship  of  complexity  with  in- 
telligence. The  pattern  is  the  same  for  both  the  PCT  and  the  TIB:  com- 

plexity is  significantly  related  to  measures  of  simple  vocabulary  and  ab- 
straction ability,  and  to  verbal  and  mathematical  skills  as  measured  by  the 
SAT.  These  findings  essentially  replicate  those  of  Harvey  (1966),  but  are 
much  weaker  than  the  Schroder  et  al.  (1967)  report  of  correlations  in  the 
.40's  when  the  variance  of  intelligence  scores  is  great.  The  results  con- 
tradict Schroder's  prediction  that  complexity  should  not  correlate  with  ver- 
bal skills,  and  especially  should  not  correlate  with  memory  functions  (vo- 
cabulary) , 

The  correlations  with  intelligence,  while  statistically  significant, 
were  not  large,  and  were  certainly  not  large  enough  to  threaten  the  value 
or  standing  of  either  test  on  this  basis  alone,  provided  it  is  understood  that 
either  measure  of  complexity  has  more  in  common  with  intelligence  than 
with  the  other  complexity  measure.  The  relationship  with  intelligence  is 
strong  enough  that  research  using  either  measure  is  of  questionable  value 
unless  controls  for  intelligence  are  incorporated.  Schroder  routinely 
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matches  Ss  for  intelligence  (e.g.  Schroder  et  al.f  1967);  Harvey  (e.g. 
Harvey,  1966)  often  does  not  mention  it,  and  therefore  presumably  does  not 
do  it. 

C.  As  to  the  existence  of  sex  differences  in  the  complexity  scores, 
results  are  not  clear-cut.  Several  alternative  answers  are  available  de- 
pending on  which  sample  and  which  analysis  one  wishes  to  rely  upon.  The 
broadest  answer,  based  on  the  Sex  x Schools  ANOVA  with  the  complete 
UNC-C  female  sample,  is  that  there  are  no  sex  effects.  However,  the 
same  analysis  performed  separately  with  the  two  distinguishable  UNC-C 
female  subsamples  did  lead  either  to  sex  or  interaction  effects. 

At  least  one  ANOVA  was  done  independently  for  each  school  in  which 
sex  differences  could  have  been  demonstrated.  Only  Smith  showed  a sex 
effect,  and  that  was  for  the  TIB-I  score;  that  is,  a strongly  interpersonal 
measure  based  on  the  TIB.  In  this  case,  females  were  more  complex,  a 
result  which  makes  some  sense  in  terms  of  the  kinds  of  stresses  to  which 
minority  groups  are  likely  to  be  exposed  (e.g.  the  greater  difficulty  the 
male  has  in  playing  his  role).  Since  students  from  Smith  are  probably  also 
the  lowest  of  the  three  groups  socioeconomically,  this  finding  partially  re- 
plicates Gardiner's  (1968),  that  among  children  from  lower  class  back- 
grounds females  were  more  complex  than  males.  The  question  of  sex  dif- 
ferences, then,  is  still  open.  Whether  a difference  will  exist  apparently 
depends  upon  population  characteristics.  A check  should  probably  be  made 
in  every  situation  in  which  a difference  might  be  important. 


D.  Differences  in  complexity  scores  among  schools  could  result 
from  any  of  many  factors,  the  most  obvious  being  intelligence,  race  and 
socioeconomic  background,  these  being  all  interrelated.  The  mean  com- 
plexity scores  for  the  three  schools  did  fall  in  the  expected  sequence,  but 
individual  comparisons  show  that  the  differences  between  Smith  and  UNC-C 
are  nonsignificant,  and  that  both  differ  significantly  from  Duke.  This  means 
that  neither  race  per  se,  nor  fairly  large  intelligence  differences  (approxi- 
mately 1 sigma  on  the  SAT),  are  sufficient  to  indicate  complexity  differ- 
ences. Of  course,  this  could  be  attributed  to  a floor  in  the  complexity 
scores,  but  that  problem  is  intrinsic  in  the  tests  and  theories. 

E.  Test  reliability,  as  estimated  by  coefficient  alpha,  is  unaccept- 
ably low  for  the  PCT  (.41),  but  reasonably  high  for  the  TIB  (.78).  The 
inter -item  correlations  for  the  PCT  were  lower  than  those  for  the  TIB  (the 
former  ranged  between  . 05  and  . 17,  and  the  latter  between  . 15  and  .47, 
and  were  mostly  in  the  . 20' s),  and  were  much  lower  than  those  reported  in 
Schroder  et  al.  (1967)  (range  . 39  to  . 62). 

It  should  be  kept  in  mind  that  the  scores  used  to  calculate  coefficient 
alpha  are  not  those  regularly  used  in  research.  In  the  case  of  the  PCT,  the 
difference  is  probably  not  important:  Although  the  score  currently  in  favor 

is  the  sum  of  the  two  highest  item  scores,  the  mean  of  all  the  item  scores 
has  been  used  and  is  presumably  still  acceptable.  Using  the  sum  of  the 
item  scores  (but  adjusting  for  nonscored  items)  instead  of  the  mean  is 
necessary  for  calculating  alpha,  but  does  not  change  score  relationships. 
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The  set  of  scores  derived  from  the  TIB  and  used  in  finding  alpha  bears 
no  such  close  relationship  to  regular  TIB  scores,  for  the  simple  reason  that 
Harvey  does  not  assign  individual  item  scores.  This  practice  seems  to  me 
to  be  a weakness  in  that  it  requires  the  scorer  to  somehow  bear  in  mind  10 
(the  number  of  items)  sources  of  information,  which  is  essentially  not  pos- 
sible <,  In  my  opinion,  the  kernel  of  this  objection  holds  even  though  the 
rater  keeps  some  sort  of  mental  cumulative  tally:  Why  not  write  it  down? 

In  the  present  case  we  did,  separately  for  each  item,  This  enabled  us  to 
obtain  the  Interpersonal  score,  and  later  to  sum  the  item  scores  so  as  to 
estimate  reliability.  In  both  cases,  the  summed  scores  correlated  highly 
with  the  regular  scores. 

The  low  reliability  of  the  PCT  is  something  of  a threat  to  it,  espe- 
cially in  view  of  the  fact  that  high  scores  are  so  rare:  The  misclassifica- 

tions  which  result  from  low  reliability  are  a serious  problem.  That 
Schroder’s  research  results  conform  so  nicely  to  theory  suggests,  in  light 
of  this  low  reliability,  that  his  cutoff  score  may  be  higher  than  would  be 
necessary  for  a more  reliable  measure.  Increasing  the  number  of  items 
(and  reverting  to  mean  scores)  would  probably  help  to  stabilize  the  mea- 
sure. Of  course,  it  would  be  valuable  to  have  an  actual  test-retest  coeffi- 
cient instead  of  an  estimate,  and  responsibility  for  this  is  basically 
Schroder’s,  since  he  is  the  person  best  equipped  to  devise  alternative  items 
and  provide  authoritative  ratings.  Until  such  an  effort  is  mounted,  the  re- 
liability of  the  instrument  may  be  regarded  as  unsatisfactory. 


. 


The  situation  of  the  TIB  is  different.  Although  coefficient  alpha  is 
acceptable,  the  onus  for  a true  test-retest  reliability  correlation  for  the 
TIB  rests  on  Harvey,  just  as  it  does  on  Schroder  for  the  PCT,  in  spite  of 
the  present  satisfactory  value  of  alpha.  This  is  so  because,  beyond  the  re- 
sponsibility of  the  test  creator  to  provide  adequate  information  about  his 
test,  the  set  of  TIB  scores  used  to  calculate  alpha  bear  a less  certain  re- 
lationship to  the  usual  TIB  scores  than  was  the  case  for  the  PCT.  This  in 
turn  is  partly  due  to  the  fact  that  confusion  in  the  TIB  scoring  is  typically 
between  a score  of  2 or  4 on  the  one  hand,  and  1 or  3 on  the  other.  Thus, 
it  is  not  uncommon  that  the  switch  of  the  score  on  a single  item  (as  on  a 
retest)  would  lead  to  a change  of  two  points  in  the  global  score,  while  it 
would  be  scarcely  noticeable  on  the  sum-of-items  score.  In  short,  al- 
though the  correlation  between  TIB-G  and  the  summed-items  scores  is 
high,  the  categorization  of  Ss  into  four  levels  based  on  a retest  might  not 
be  so  much  in  agreement.  I would  not  be  surprised  if  alpha  overestimates 
the  reliability  for  the  TIB,  but  the  issue  could  be  resolved  empirically  more 
quickly  and  easily  than  by  debate. 


Comment  and  Implications 

Given  a choice,  I personally  prefer  Schroder's  position  to  Harvey's, 
for  two  reasons.  First,  the  theory  is  more  elegant,  in  that  it  is  simpler, 
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more  easily  applied  to  a broad  range  of  cognitive  domains,  and  permits  be- 
havioral predictions  which  are  more  clearly  related  to  the  theory,,  By  these 
criteria,  the  theory  is  more  powerful  than  Harvey's,  although  Harvey  seems 
to  regard  his  theory  as  relating  tb  a greater  variety  of  behavior  than 
Schroder,  so  in  this  sense  Harvey's  may  be  the  more  powerful. 

The  second  reason  I prefer  Schroder's  position  is  that  his  research 
gives  the  impression  of  greater  rigor0  The  most  obvious  case  in  point  is 
the  difference  in  approach  to  the  scoring  criteria  for  the  respective  tests. 
Beyond  this  is  the  looseness  in  what  Harvey  is  willing  to  call  validational 
evidence,  plus  his  seeming  failure  to  control  for  intelligence  in  these  vali- 
dational studies o Naturally,  the  best  comparison  of  the  two  tests  would  be 
with  regard  to  predictive  efficacy,  but  this  was  not  possible  in  this  study. 

It  would  be  difficult  to  compare  the  tests  using  existing  literature,  since 
there  seems  to  be  very  little,  if  any,  overlap  in  the  actual  dependent  vari- 
ables which  have  been  investigated. 

Although  Schroder's  seems  the  stronger  position,  he  is  not  without 
problems.  At  the  theoretical  level  he  has  yet  to  elucidate  the  "connected- 
ness" concept.  As  indicated  earlier,  if  integrative  complexity  is  to  be 
distinguished  from  a dimensional  position,  it  will  have  to  be  on  the  basis 
of  connectedness,  yet  it  is  never  made  clear  what  this  connectedness  really 
is,  or  how  it  is  represented  in  the  theoretical  model. 

The  second  problem  is  the  low  reliability  of  the  PCT  (as  estimated 
by  coefficient  alpha).  The  PCT  inter -item  correlations  found  here  were 
much  lower  than  those  reported  in  Schroder  et  ad.  (1967).  Coefficient  al-’ 
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pha  will  become  larger  as  the  inter-item  correlations  increase,  but  one 
must  assume  that  the  scores  (ratings)  are  reliably  determined  if  alpha  is  to 
be  taken  as  an  estimate  of  test-retest  reliability.  Thus,  both  inter -item 
and  inter -rater  correlations  must  be  adequate. 

It  is  interesting  to  speculate  as  to  whether  the  change  in  the  applica- 
tion of  scoring  criteria  (see  Chapter  III),  since  Schroder  et  al„  (1967)  is  re- 
lated  to  the  reduced  item  intercorrelations  . Schroder's  explanation  for  the 
shift  in  distribution  of  PCT  scores  may  be  interpreted  as  implying  that  the 

i 

1967  inter-item  correlations  were  inflated  by  the  interpretations  made  by 
the  raters.  These  interpretations,  being  unsystematic,  should  reduce  inter- 
rater reliabilities,  but  the  rater  reliabilities  reported  in  Schroder  et  al. 
seem  fully  adequate  (.  80-.  95).  The  combination  of  item  and  rater  inter- 
correlations in  Schroder  et_  ah,  implies  a high  value  of  alpha  and  some  rea- 
son to  expect  that  alpha  reflects  reliability.  The  low  inter-item  correla- 

i 

j 

tions  in  the  present  study  virtually  preclude  high  reliability,  irrespective 
of  rater  reliabilities.  The  former  situation  seems  preferable.  Since  the 
changes  in  scoring  procedure  were  introduced  because  of  the  difficulty  in 
distinguishing  among  scores  at  the  complex  end  of  the  dimension,  and  since 
the  PCT  format  (two  or  three  sentences  in  response  to  a sentence  stem, 
with  a short  time  limit)  restricts  the  expression  of  complex,  abstract  con- 
cepts, the  most  obvious  first  attempt  to  solve  the  low  reliability  problem 
would  be  to  arrange  for  stimuli  and/or  time  requirements  which  would  tend 
to  draw  (or  at  least  permit)  more  complex  responses. 


Other  issues  become  problems  only  within  a context.  As  it  stands 
now,  unless  E is  willing  to  screen  large  numbers  of  Ss  in  order  to  find  the 
few  who  are  complex,  and  has  reason  to  expect  in  advance  that  his  efforts 
will  be  rewarded  (note  that  no  Ss  with  scores  of  7 or  higher  were  found  at 
Smith  and  only  2 at  UNC-C,  a complexity  rate  of  less  than  1%  for  a total  of 
nearly  300  Ss),  Schroder's  theory,  measure  and  research  are  of  limited 
value.  If  one  is  interested  in  studying  naturally  occurring  or  random  groups 
of  Ss,  as  most  of  us  are,  complexity  (defined  in  Schroder's  terms)  will  be 
either  rare  or  nonexistent.  This  problem  is  compounded  by  the  scarcity  of 
information  about  middle  complexity  Ss.  There  appears  to  be  little  concern 
for  them  in  Schroder's  work,  consequently  one  does  not  know  where  they 
stand,  or  where  to  expect  them  to  stand,  relative  to  the  extreme  groups.  It 
is  possible  that  changing  the  test  as  suggested  above  would  improve  the  dis- 
tribution  of  scores.  Learning  about  the  behavioral  characteristics  of  the 
mid-complexity  group  is  a matter  of  research.  At  least  one,  and  prefer- 
ably both,  will  be  necessary  before  the  variable  will  become  useful  in  most 
research. 

Given  these  difficulties,  the  interested  researcher  may  wish  to  turn 
to  the  dimensionally  oriented  theories  and  measures.  Of  these,  Bieri  (e.g. 
Bieri,  1966)  is  the  leader  in  terms  of  published  research,  but  Scott's 
article  mentioned  earlier  (Scott,  1969)  shows  evidence  of  an  unusual,  and 
laudable,  degree  of  concern  for  preliminary,  methodological  questions. 
Perhaps  the  major  weakness  in  Scott's  study  is  that  he,  too,  fails  to  control 
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for  intelligence.  Beyond  this,  there  seems  to  be  considerable  promise  in 
his  work. 

The  decision  to  adopt  a dimensional  definition  of  complexity  may  not 
put  one  so  much  at  odds  with  the  "-integrative  complexity"  theorists  as  might 
be  thought  at  first.  As  mentioned  earlier,  TIB  scores  correlate  with  REP 
dimensionality  (Harvey,  1966),  and  Schroder,  at  least  at  the  theoretical 
level,  will  not  become  clearly  different  from  the  dimensional  viewpoint  un- 
less "connectedness"  can  be  defined  so  as  to  achieve  the  distinction. 

The  correspondence  between  Schroder  and  the  dimensionalists  may 
potentially  be  increased  in  two  different,  but  related,  ways.  Schroder's 
complex  thinking  seems  to  involve  two  components;  first,  the  ability  to  per- 
ceive and  tolerate  alternative  interpretations  of  a situation  or  event,  this 
being  in  a sense  the  first  stage  of  complexity,  and  second,  the  development 
of  some  sort  of  abstract  basis  for  relating  and  evaluating  the  alternatives. 

It  may  be  that  the  first  of  these  could  be  fruitfully  conceptualized  as  a dif- 
ferentiation of  evaluative  judgments  (the  development  of  several  kinds  of 
"good"  and  "bad")  and/or  a reduction  of  the  correlations  between  an  "evalu- 
ative dimension"  and  other  nominally  nonevaluative  dimensions.  Either  of 
these  could  lead  to  judgments  involving  relative  value,  various  kinds  of 
value,  and  simultaneous  positive  and  negative  value,  as  well  as  relatively 
nonevaluahive  judgments. 

The  second  way  to  increase  the  relationship  between  integrative  com- 
plexity and  dimensional  complexity  would  be  to  assume  that  the  connecting 
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or  relating  component  of  Schroder's  theory  represents  superordinate  di- 
mensions, and  that  there  is  therefore  some  hierarchical  organization  of 
dimensions  , Such  a conceptualization  is  more  difficult  with  dimensions 
than  it  would  be  with  categories , For  example,  one  of  the  few  efforts  to 
investigate  a hierarchical  structure,  that  of  Zajonc  (I960),  did  not  use  di- 
mensions. It  is  tempting  to  suggest  that  these  hypothesized  superordinate 
dimensions  are  based  on  the  person's  philosophical  values,  but  if  this  is  to 
be  the  case,  distinctions  must  be  made  between  values  which  are  and  val- 
ues which  are  not  capable  of  preserving  alternative  viewpoints  , The  struc- 
tures would  be  quite  different  in  the  tv/o  cases. 

It  would  seem,  then,  that  the  insights  from  the  theory  of  integrative 
complexity  could  be  applied  to  research  on  natural  or  random  groups  of  Ss 
either  by  (a)  improving  on  Schroder's  measuring  techniques  and/or  study- 
ing middle  as  well  as  extreme  complexity  Ss,  or  (b)  by  interpreting 
Schroder's  position  in  dimensional  terms,  and  looking  for  the  differentia- 
tion of  evaluativeness  or  for  a hierarchical  organization  of  dimensions. 
Either  approach  could  provide  valuable  extensions  of  the  theory  of  cogni- 
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INSTRUCTIONS,  ITEMS,  AND  A SAMPLE  PAGE  FOR  THE 
PARAGRAPH  COMPLETION  TEST  AND  THE 
"THIS  I BELIEVE"  TEST 


i 
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! 

i 

i 

i 

On  the  following  pages  you  will  be  asked  to  write  a short  paragraph 

on  your  beliefs  about  or  opinions  of  several  different  things. 

■ 

On  each  page  you  will  find  either  the  beginning  of  a sentence  (a  word 

j 

or  phrase  followed  by  several  dots,  for  example,  "I  like  . . . ")  or  a whole 
sentence  beginning  with  "This  I believe  (for  example,  "This  I believe  about 

j 

human  nature")  „ 

i 

j • J 

If  the  beginning  sentence  is  unfinished,  complete  it  and  try  to  add  at 

I H f I 

least  two  further  sentences  on  the  same  topic,, 

i 1 

j 

If  the  beginning  sentence  is  complete,  try  to  write  at  least  three  sen- 

I I 

i - .1 

tences  on  the  topic, 

I I 

In  other  words,  in  either  case  try  to  write  at  least  three  sentences, 

' 

\ 

i 

These  answers  cannot  and  will  not  be  graded  right  or  wrong,  so 

! I 

please  write  what  you  genuinely  feel  or  believe.  Work  quickly,  but  be  as 
clear  as  possible. 

J i 

You  will  be  given  130  seconds  for  each  answer.  After  110  seconds  I 

! 
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will  say  "Finish  your  sentence.  " Make  sure  your  last  sentence  is  com- 
pleted. At  130  seconds  I will  say  "Turn  to  the  next  page,"  You  should  be- 
gin on  the  next  page  immediately. 

You  should  write  on  the  topics  in  the  order  of  their  appearance.  Wait 
to  turn  each  page  until  I give  the  signal.  Once  you  have  turned  a page,  do 
not  go  back  to  it. 

Do  not  turn  this  page  until  you  are  given  the  signal. 

Please  raise  your  hand  if  you  have  any  questions. 
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Items  for  the  Paragraph  Completion  Test  (PCT): 

When  preparing  for  an  exam  ---  (this  is  a non-scored,  warmup  item) 
Rules  - - - 

When  I am  criticized--- 
When  I am  in  doubt- -- 

When  others  criticize  me  it  usually  means--- 
Confusion--- 

Items  for  the  "This  I Believe"  Test  (TIB): 

This  I believe  about  the  American  way  of  life. 

*This  I believe  about  marriage. 

*This  I believe  about  people. 

This  1 believe  about  religion. 

*This  I believe  about  compromise. 

'■'This  I believe  about  friendship. 

*This  I believe  about  sin. 

This  I believe  about  foreign  aid. 

This  I believe  about  capital  punishment. 

This  I believe  about  world  government. 

❖ Items  marked  with  an  asterisk  were  those  used  to  determine  the 
Interpersonal  scores  (TIB-I  and  TIB-IF). 
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Sample  Page 


When  preparing  for  an  exam, 


Try  to  write  at  least  3 sentences, 


Do  not  turn  this  page  until  you  are  given  the  signal. 
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